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PREFACE 



The Conference on Research Trends in Computational Linguistics was 
held in Washington, D.C*, March 14-16, 1972, under the auspices of the 
Center for Applied Linguistics in cooperation with the Association for 
Computational Linguistics and with the support of the National Science 
Foundation. Its purpose was to assess the current situation in compu- 
tational linguistics and to project future trends and needs. 

The invitation list was prepared in such a way as to reflect the 
diversity of current work and to stretch somewhat the traditional scope 
of the field by including phonologists and psycholinguists, whose work 
will increasingly interact with computational linguistics in the years 
to come. The area of linguistic data processing, on the other hand, 
was excluded as adequately treated elsewhere. 

The Conference was organized into workshop sessions; the individual 
groups' discussions were later summarized by the chairmen and presented 
in general sessions for comment and reaction. The task of summarizing 
the views of a workshop group into a coherent report is a difficult one; 
the reports benefit by being tempered by the personal views and knowledge 
of the particular experts chosen as chairmen. Additional comments were 
obtained when the preliminary versions were presented at a special 
session of the Tenth Annual Meeting of the Association for Computational 
Linguistics at Chapel Hill, July 1972. 

"Computational Linguistics and Linguistics", the first plenary 
session, set the tone of the meeting; as Barnes* report shows, some 
agreement evolved on a definition and understanding of the scope of 
computational linguistics. "Integrated Computer Systems for Language" 
were also discussed in a plenary session. Its chairman, Simmons, 
summarizes the discussion, adds a personal statement of his view of the 
field, and appends an outline of suggestions for future research. 

Three substantive subject areas were discussed at length by smaller 
groups. Almost all of the participants in Petrick*s sessions on 
"Computer-Oriented Grammars and Parsing" have at one time or another been 
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engaged In producing such gramnars. His report Includes first-hand 
summaries of current work. As Cooper notes, 'Machines and Speech" was 
on the agenda not because of past performance but because of future 
expectations. Reading machines for che blind and speech understanding 
systems form the Initial points of contact; Cooper spells out progress 
to date and lists research needs. "Language Performance" constitutes 
another topic of potential Interest to the field. Carroll surveys the 
past applications and concludes that present work in computational 
linguistics can benefit psychological and psychollngulstlc studies. 

Important non-technlcal questions were also discussed. Walker's 
essay on "Social Implications of Automatic Language Processing" reflects 
concern lest computational linguists' work be co-opted for anti-social 
purposes; overpowering this concern there is stress on the positive 
implications of the field. Sedelow's session treated the mixed topics 
of "Professional Ethics, Standards, and Education". 

The preceding paragraphs present a brief outline of the content of 
the meeting and the resulting reports. It is Important to add that 
although the participants in the meeting expressed many diverse views, 
there was a pervading feeling of common Interest and optimism for the 
future of computational linguistics. The research problems facing the 
field are difficult but imnensely interesting; recent developments 
leave computational linguists hopeful. 
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COMPUTATIONAL LINGUISTICS AND LINGUISTICS 
Robert F. Barnes, Jr., Chairman 



As V*) clous participants began to express their views on the 
nature and stratus of Computational Linguistics (CL) and on its 
relation to Linguistics In general. It seemed at first that a con- 
sensua might be a hopeless goal* But as the discussion continued, 
two things gradually became clear: (1) that CL does Indeed comprise 
a broad range of studies, activities, and techniques; and (2) that, 
as a result, apparent disagreement between differing views on CL Is 
often simply a reflection of differences In relative emphasis among 
these areas • 

For example, to computational linguists concerned chiefly with 
computer Implementation of natural-language grammars, CL seems 
largely to be an "experimental branch** of Its parent field. But 
further discussion brought out clearly that CL Is far from being 
merely the linguistic counterpart of, let us say, experimental physics; 
a number of genuinely theoretical areas and questions exist In CL, too 
— theorems relating formal grammars to automata are but one example. 

Other speakers suggested an opposite extreme that the entire field 

of linguistics Itself (at least Insofar as its empirical aspects are 
concerned) is essentially computational in its reliance on the computer. 
From this point of view, there is no distinction at all between the two 
fields, and the very term **computational linguistics** seems distinctly 
odd — as if we spoke of **microscope biology'* or **slide-rule engineering**. 
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But in some areas of Unguis tics (especially the more traditional 
ones), computers are used as convenient tools, which in principle, 
could be replaced with an army of slaves (or perhaps graduate 
assistants). In other areas, however, computational notions play a 
more conceptually central rolu. Thus, without completely excluding 
from CL such activities as the development and application of computer 
procedures to perform traditional linguistic activities, we can 
distinguish a narrower core area in which notions from mathematics 
and computation interact crucially with those from linguistics. The 
central focus of this interdisciplinary area is the linguistic (or 
more generally, symbolic) algorithm, and it is with the structure 
and application of these that CL is peculiarly concerned. 

It should be noted that this concern makes CL a genuine inter- 
disciplinary field in its own rignt, not merely a sub-field of either 
linguistics or computation. Indeed, some of the particular concerns 
of CL may go considerably beyond what is customarily seen as lying 
in either of these spheres of interest, A computational linguist 
working on problems in compiler syntax and semantics, for example, 
might well employ concepts, techniques, or results typical of CL 
without being seen as doing work in either linguistics or mathematics. 
Another computational linguist (or indeed, even the same person) 
tiight also be working on algorithms directed toward the confirmation 
or d is confirmation of some purely linguistic theory; such an activity 
may be quite machine*-f ree and may not at all be seen as belonging to 

the field of computation either in theory or practice. Furthermore, 

certain applications of CL techniques, such as automatic f^tyle- or 
content-analysis of natural-language source material, belong even less 
uniquely either in lingi v,tics or in computation. Still more remote 
from both fields are such activities as the application of generative 
grammars to musical composition — that is, natural extensions of certain 
indisputably computational -Unguis tic studies which may well be seen as 
constituting peripheral areas of CL. But throughout this range of 
activities, the essentially algorithmic nature of CL gives it a thorough- 
going air of rigor which is not always present in the parent field of 
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linguistics; and Its central concern with the symbolic nature of Its 
source material distinguishes It from the general field of computation. 

It should also be noted here that though almost everyone feels 
some degree of uneasiness with the term "Computational Linguistics" 
Itself, no one seems able to offer a clearly superior substitute. 
Broader terms have been suggested but they seem no less vague and 
indefinite, and sometimes are so broad that it would be hard to say 
what is not CL. It may well be that for our purposes a vague, ill- 
defined, but relatively narrow term is better than a vague, ill-defined 
broad one • And in any case, as a field thrives and grows, the origins 
of its title are of considerably less importance than its Identity as 
a healthy intellectual discipline. 

As the group's discussion continued, there seemed, therefore, to 
come into focus (in addition to the sort of computational assistance 
utilized by trn^lltional linguistic studies) two characteristic sorts 
of activities ^;:n)v: perhaps implicitly define CLr— (1) the application 
of linguistic r^u^wepts and techniques to problems dealing with computers 
and computation, and (2) the algorithmic interpretation or realization 
of linguistic problems or theories. (Sometimes, of course, as in the 
development of grammar-testing routines, a single activity may be seen 
as belonging to both.) In each of these areas, but especially in the 

latter, CL takes on a special character as a discipline neither fully 

theoretical nor fully empirical, but serving as a rigorous "bridge 
theory" connecting other, more abstract theories with the specific 
facts they are to account for. It is in just such a role, for example, 
that studies in CL may prove helpful in connecting the notions of 
linguistic "competence" and "performance" in an enlightening, yet 
formally precise way. 

It has been generally accepted since de Saussure that linguistics 
need not concern itself only with describing actutil utterances, with 
their idiosjmcratic ranges of linguistic features, many of them 
"accidental". But exactly which features are seen as accidental and 
which as essential depends very greatly upon one's underlying linguistic 
"paradigm" (to use Thomas Kuhn's helpful concept). A key Issue between 




the pos t-Chotnsklan and pte-Choinsklan paradigms for linguistics centers 
on the role of performance In a linguistic theory. The one sees 
performance as empirical data — evidence for or against an abstract 

theory of competence; the other sees It as an empirical phenomenon 

the very subject matter to be accounted for by the theory. (John 
Searle, In the New York Review of June 29, 1972, puts this distinction 
very nicely.) 

Now, when theory and data fall to match. It need not be the 
theory which Is rejected; 1^ one can provide a systematic explanation 
of the mismatch, the theory and data can still be seen as compatible. 
The abstract theories of the transformationalists (as well as the 
transformationalists themselves) are often criticized on their seeming 

disregard of facts of linguistic usage the performances we encounter 

every day. The transformationalists generally respond that accidental 
features of performance have created apparent Inconsistencies, which 
can ultimately be explained at some later point in the theoretical 
development. Who has the better part of the argument? In the absence 
of some systematic way to relate theoretical competence to empirical 
performance, It's hard to say. It seems possible, though, that the 
rlc,orous nature of CL, structured as It is about algorithms (which can 
be seen as Idealized performances of a certain sort), may offer exactly 
this needed connection between competence and performance. 

As one clarifies the nature of CL, one almost Inevitably explores 
what Its future prospects may be, as shown in the preceding remarks. 
In this direction, a brief glance at the history of the field also 
reveals two significant facts which suggest, perhaps, where CVc most 
fruitful development may lie. First, over the past years, one can 
note a definite trend away from narrow studies oriented solely toward 
particular appllcatlonal goals. In part, this represents the maturing 
of the discipline as Its own characteristic problems emerge and 
Increasing attention Is directed to them. However, there seems to be 
an additional feature whose significance is emphasized by the si^cond 
point — namely, that these earlier "miss lon-orlented'* studies wet-e 
often conducted in a framework of (then-existent) theory, often brought 




in from outside the discipline, which has since been recognized as 
inadequate. (What this says in general about the prospective adequacy 
of specific, mission-oriented theory may be left to speculation.) More 
recently, however, one can identify a rich stage of theory arising from 
within CL itself. The theories of this period are often considerably 
broader than the older ones; they often span the entire syntactic- 
semantic-pragmatic spectrum. 

Some of this new theory in CL appearo to be relatively "conven- 
tional" in the sense that it represents new advances along previously 
established theoretical lines; new parsing procedures for context-free 
grammars would perhaps serve as examples of developments of this sort. 
Many of the new developments, however — particularly in areas associated 

with semantic information processing have involved the formulation of 

their own theoretical structures. For example, a long-standing framework 
for the study of language, widely accepted in logic and the philosophy 
of language, redts on a threefold distinction between syntax, semantics, 

and pragmatics roughly, the theories of form, meaning, and use. Chomsky' 

early arguments against basing grammatical theory on meaning, as well as 
the then-prevalent structuralist attitude toward meaning, were in part 
based on this distinction. More recently, however, there are indications 
that an adequate theory of syntax may require semantic and pragmatic 
components of some sort, but the proper theoretical organization of such 
components is still very uncertain. In CL, on the other hand, some of 
the most successful work in semantic information processing has tended 
not to presuppose the syntactic-semantic-pragmatic distinction, or else 
to utilize other distinctions which might be vaguely similar but which 
are made on somewhat different grounds. In these and a number of similar 
instances, CL has begun to show an intellectual maturity and self-suffi- 
ciency which is genuinely promising. 

As a result of this increased theoretical development, one may 
expect CL not only to concern itself with its "own" questions and 
problems, but also to contribute toward broader areas in linguistics it 
has tended to neglect in the past. Computational implementation of 



phonological theories — in either generative or analytical programs — 
may provide fruitful contributions to work on speech synthesis and 
recognition, as well as valuable empirical test procedures for more 
general phonological theories. Developments in computational seman- 
tics (such as those previously mentioned) may offer mechanisms for 
an apparently needed greater theoretical richness and structure in 
semantics, as well as for applications in grammar and phonology. 
Additionally, by presenting its own new viewpoints toward such orthodox 
linguistic distinctions as performance vs. competence, declarative vs. 
imperative sentence, syntax vs. semantics vs. pragmatics, etc., work 
in CL may contribute considerably to the clarification of the founda- 
tions of linguistics. 

(Though they weren't mentioned explicitly in the meeting, David 
Hays's two recent papers on the nature of CL*, baseJ on remarks made 
at the 1971 International Meeting in Hungary, are so to the point that 
they must be mentioned here. While the viewpoint presented there is 
not precisely that given here, the two seem more complementary than 
contradictory; and the papers should be read by all concerned with the 
future of the field.) 

In sum, CL is not linguistics — in whole or in part. Its concern 
is legitimately far broader than natural language, though much of its 
attention is directed toward problems involving automatic processing 

of natural language material. Its central focus the notion of 

symbolic algorithm — represents a standard in rigor and precision which 
general linguistic procedures do not always attain. On the other hand 
it is quite clear that CL does have close historical, conceptual, and 
practical ties to linguistics; and although we expect CL to develop in 
such a way as to make more precise its own characteristic nature, we 
also expect the relationship between the two fields to remain close 
and mutually profitable. 



*'*rhe Field and Scope of Computational Linguistics" and "The Past, 
Present, and Future of Computational Linguistics", available from 
the author, Department of Linguistics, State University of New York 
at Buffalo. 



^- 13 



INTEGRATED COMPUTER SYSTEMS FOR LANGUAGE 
Robert F. Simmons, Chairman 



Most recent and notable among Integrated systems for language 
processing are Woods* Natural Language Retrieval System and Wlnograd's 
system for understanding commands to a robot for operations on a 
limited world of colored blocks. Other examples of more or less 
successful Integrated language processing systems Include mechanical 
translation, natural language data manipulation, robot command systems, 
natural language CAI, semantic analysis of medical data, the analysis, 
synthesis and testing of linguistic rules, text analysis, paraphrase 
and question answering, and text generation* The current Advanced 
Research Projects Agency (ARPA)-sponsored research toward useful speech 
recognition systems Is the uost recent large scale approach to Inte- 
grate knowledge of semantics, syntax and phonolog> Into a capability 
for computer understanding of a useful spoken vocabulary. From this 
Inventory of examples we can Infer that an Integrated language process- 
ing system Is a processor that organizes a set of component linguistic 
processes Into an effective and potentially practical language process- 
ing device. The components may Include speech analysis and synthesis, 
syntactic, semantic and logical operations* 

One aspect of Integrated language processing can arbitrarily be 
dlmenslonallz^ed as the vertical Inter-relation of morphological, 
syntactic, semantic and pragmatic Information. On this dimension. 
Interaction among levels Is used to resolve ambiguities and to clarify 
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understanding. The most thorough example currently available as a 
working program appears to be Wlnograd's system that uses syntactic, 
semantic, and pragmatic information to resolve possible ambiguities 
of English commands and questions. The complexity of Integrated 
systems in their vertical dimension is such that a single person can 
hardly hold all aspects of their components' interaction in his mind 
at one time. This level of complexity requires the use of high level 
languages such as LISP, PLANNER, etc., with the consequence that 
computation is fairly slew and fast random access memory requirements 
are quite large. Alternatively, such a system may be built as a team 
effort, with its attendant communication problems among team members 
and subprograms. The requirement for careful documentation of such 
systems was emphasized and suggestions emerged for the use and 
development of text editing and automated documentation-program 
packages. Automated flow-charting programs were suggested for appli- 
c^Jtion to FORTRAN, ALGOL, and PL/I type programs, and systems for 
displaying the calling structure of functions and other organizational 
structures of LISP systems were briefly described. 

A second aspect of integrated systems can arbitrarily be dimen- 

slonalized as horizontal. On this dimension problems occur at the 

interface of a given language proces«»ing system with the user and with 

other systems. Careful human engineering of the user interface is 

* 

required to enable the convenient insertion of lexical entries, gramma 
rules and semantic and pragmatic information as well as the testing of 
these data for consistency and accuracy. The need for fast on-line 
intera<::tive consoles is most obvious here. Problems concerned with 
evaluation of the effectiveness of language processing systems and 
their generalizability emerge when considering horizontal integration. 
These areas are suggested for continued research. 

A contrast emerged between what is currently desirable and what 
is minimally necessary in the way of hardware for research and develop 
ment of integrated language processifing systems. Ideally, 'iomething 
like the following configuration of hardware is desired: 
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INPUT 


CENTRAL PROCESSOR 


OUTPUT 


Character Reader 
Remote Quiet Terminals 
Voice Input 
Graphics 


Core Memory i-1 million words 
Extended Core 1 million words 
Disc 10 million words 
Networked Miniprocessors 


Print 
Voice 
Graphics 



*numbers signify order of magnitude only 



Software support should include a multi-access timeshared system, basic 
languages such as FORTRAN, ALGOL, PL/I, LISP, and SNOBOL. Specialized 
text editing, parsing, generation and question answering program systems 
would serve as a computational library for the system. 

In constrast to the ideal system designed to support integrated 
language processors, the minimum requirements can already be found at a 
number of locations around the country in existing computing and commu- 
nication networks. The Massachusetts Institute of Technology AI labora- 
tory; Bolt,Beranek & Newman Inc.; Stanford University; Carnegie-Mellon 
University; University of Texas; Stanford Research Institute; and the 
ARPA computing research network are among a number of computing facili- 
ties adequate to the purpose. Unfortunately, adequate access to such 
facilities is administratively limited and very expensive. 

A significant question was raised about hardware requirements 
which led the discussants to conclude that present day hardware systems 
are theoretically adequate to support research in computational linguis- 
tics but, in fact, the research user typically commands less computer 
utility in terms of central processors and memory than he did in the 
mid-sixties. The reason is that despite the truly impressive increases 
in size of memory and processor efficiency, the computing utility is 
shared much more widely, usually reducing the amount of utility that 
[• each person can command. The consequence appears to be that computa- 

i tional linguists often find it difficult to obtain sufficient computing 

i 

• power to construct large integrated systems. For the moment, this lack 

I does not block research; but it does bias the activity toward the 
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development of component software rather tnan large integrated systems 
which require more computation utility than Is readily available to the 
researcher • 

The suggestion was made that computational linguistics research 
might follow the path customarily used by atomic physicists In concen- 
trating expensive equipment at a few centers. The ARPA network Is a 
good example of this approach In the related field of artificial intel- 
ligence research, and it might well be recommended for our own attempts 
to develop practically useful integrated language processing systems. 

The foregoing paragraphs represent the essence of the morning's 
discussion on Integrated Systems in computational linguistics. Behind 
this discussion lay a considerable amount of common knowledge of the 
current state of the art, the research problems and the implications 
of integrated systems for the future of the discipline. In this section 
an attempt is made to render some of this implicit background explicit 
to the reader. The effort is of course significantly biased by the 
author's perception of the situation. 

First it appears that there has been significant development of 
syntactic analysis procedures over the past few years. Petrick's report 
outlines the state of the art in terms of the many parsers that are 
currently more or less well-developed at various locations. One may 
note in particular that many of the mature projects such as those re- 
ported from Europe, the still-healthy mechanical translation project of 
Lehmann and Stachowitz (1972), and numerous newer language processing 
systems around the country, including those of Winograd (1972), Thompson 
(1964), Carbonnel (1970), Heidorn (1971), and Sinmons (1972), did not 
pattern themselves on the strict transformational model. Some systems 
such as those of Woods (1970) and Kellogg (1968), while originally 
based on transformational theory, reveal their transformational origin 
mainly by their use of deep syntactic structures and some use of 
selection restrictions and semantic markers. 

Placing these observations in the context of: (1) the rapid 
development of case structure theory, (2) Winograd 's application of 



Halllday's (197.0) syntactic approach, and (3) the computational develop- 
ment by Woods (1970) and by Bobrow and Frr.ser (1969) of the Thome, 
Bratley and Dewar (1968) use of a finite state network approach to 
grammar, it is possible to see the emergence of a new theoretical 
framework for computational linguistics. This approach is somewhat 
eclectically based on the use of some form of procedural grammar that 
is represented as rule sets (see Heidorn), as augmented finite state 
nets (see Woods), or as procedures (see Winograd, Schank). 

For structures to represent the underlying meaning of sentences, 
three approaches are quite prominent. To the extent that a phrase refers 
to some object in a data structure, it is quite valuable to follow the 
procedural semantics approach described most thoroughly by Woods (1968). 
In this approach the meaning of a phrase is the value returned by a 
procedure that uses elements of the phrase to identify a data object or 
set as its referent. 

Winograd carried the procedural semantics approach even further by 
interpreting relational words such as verbs and prepositions as proce- 
dures in a robot command language. The MicroPlanner Interpreter of this 
* language is sophisticated enough to accomplish quite significant infer- 
ences. Consider, for example, the command, "Put the red block in the 
blue box", which translates into a procedural expression such as, 
(PUT Bl B2). If the blue box, B2, already contains an object, the PUT 
function is able to call other functions to temporarily release the 
block, Bl, remove the object from B2, then pick up Bl again and finally 
succeed in putting it in the box. 

A second approach that is theoretically satisfying but computation- 
ally cumbersome is to represent the meaning of a sentence as a statement 
in symbolic logic. Such a statement is then taken as a theorem whose 
truth value with reference to a set of axioms that comprise the data 
base can be determined by use of resolution techniques. (See Green 6e 
Raphael, 1968; Palme, 1971; and Sandewall, 1970). 

A third approach — one that is applicable to text analysis is to 

resolve the sentence into a deep-case representation which is conceptual 
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Ized as an event description in terms of a verbal relation and its 
arguments « The meaning of this event description is a series of pro- 
positions implied by the event. (See Schank, Lehmann and Stachowitz, 
and Sinsnons.) 

Since the third approach is rather new, it would best be illus- 
trated by the following example: 



The man sailed a boat from the lighthouse to his dock. 



A form of its deep case structure is as follows: 

MOVE via sail 

CAUSAL ACTANT 1 the man 

THEME boat 
SOURCE the lighthouse 

GOAL his dock 



Some of the propositions implied by this structure can be seen in the 
diagram of Figure 1. Each node-arc-node in that diagram represents 
such a proposition. For example, Causal Actant of MOVE** shows 

that the man instigated the action; '"MOVE THEME BOAT** shows the boat 
received the action; *^iAN Possess DOCK** represents the meaning of 
**hi8 dock**; *'BOAT At, tl LIGHTHOUSE** shows an initial state jof the 
event; and *^N At, t2 DOCK** shows that the position of the man at the 
end of the event was that of the dock. The other arcs can be interpre- 
ted in similar fashion. The entire set of such arcs provides a first 
level of inferential meaning for such purposes as question answering 
and problem solving. 

These three approaches to representing the meaning of English 
sentences are quite obviously the current computational nominations 
for semantic deep structures underlying the surface form of sentences. 
Each has proved useful in some integrated system application. There is 
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Graph of Sentence Meaning 
Figure 1 
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enough similarity among the approaches to suggest that they may In 
fact be different points of view and that a given semantic structure 
has procedural, logical and proposltlonal descriptions. 

The means for computing these underlying structures are remark- 
ably similar at a certain level of abstraction. The surface structure 
of an English sentence Is analyzed with the aid of a grammar, a 
lexicon and a parsing algorithm to produce noun phrase and verb phrase 
components. Either at the point of recognizing a phrase, or later, a 
series of operations is applied to transform each phrase into the 
desired form of semantic structure* Although syntactic analysis re- 
mains a difficult, indeterminate, recursive process, the clear defini- 
tion of the desired semantic structure is used successfully to simplify 
the process. The validity of the resulting semantic structure is 
measured by its usefulness in question answering, paraphrase, and 
inferential capabilities. Rationalist arguments about the exact form 
and time of application of transformational rules are essentially 
irrelevant in this empirical approach, and the definition of semantic 
deep structure is influenced most strongly by pragmatic considerations. 

What we see in looking briefly at the development of integrated 
systems for understanding English is that computational linguistics is 
scientifically past its adolescence. While not being "school-bound", 
it is nevertheless deeply influenced by such syntactic theorists as 
Harris, Chomsky, Halliday, Thorne and others. It reflects semantic 
theories of Tarski, Camap, Katz, Woods, and most recently Fillmore, 
Chafe and Grimes. Its methods clearly demonstrate the usefulness of 
computational theories of Thompson, Kuno and Oettinger, Woods and 
others. The field is also fortunate in that a group of psychologists 
are exploring the applicability of its case models and procedural 
semantic structures as models of human memory processes. (See Frijda, 
1972; Collins and Quillian, 1969; Norman, 1972; and Frederikson, 1972; 
among others.) And Carroll, in his report, suggests other valuable 
relations between computational linguistics and psychology. 




But post-adolescence Is not maturity. Computational realizations 
of natural language processing capabilities are impressive in their 
ability to understand small subsets of natural language but a great 
deal of development is required to achieve practical applications. 

Most existing systems are dealing only with single sentences and 
only the most recent few resolve some simple cases of pronominal and 
anaphoric reference. Wilks (1968), Heidorn (1971), and Su (1971) have 
studied paragraphs or larger units of text, but these appear to be 
initial exploration.i of a very large area of computational linguistic 
research. Few of the systems have utilized peripheral storage to allow 
for growth of the large lexicons and grammars that are usually required 
for practical applications. The task of producing large, consistent 
lexicons and grammars has only occasionally been attempted. (See Woods, 
1970; Sager, 1970; and Lehmann and Stachowitz for examples of such 
attempts .) 

The capability to analyze unedited text essential for many 

mechanical translation and information retrieval applications7--is far 
out of reach at present, depending on the development of large computa- 
tional lexicons and grammars for a large stable system with much use of 
peripheral storage. An area in which very little work has so far been 
accomplished is that of developing convenient means for nonlinguists to 
communicate lexical and grammar structures to the system. 

But applications of this post-adolescent field of computational 
linguistics constitute the most active area of research. Early work on 
stylistic and content analysis (see Sedelow, 1969; Stone, 1962) has 
resulted in tools frequently used by behavioral scientists and humani- 
ties scholars for a wide variety of purposes. Another early development 
of keyword analysis of sentences is still a major tool for information 
retrieval applications. Several efforts at natural language data 
management (see Woods, Thompson and Kellogg for examples) have explicitly 
practical goals. A most recent applications area is that of Computer 
Assisted Instruction with studies by Wexler (1970), Carbonnel (1970), 
Brown (1972), Simmons (1972), and others. The attempt in this area is 
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to use language processing and question answering technology to develop 
some approximation to a tutorial system that can understand a student's 
questions and statements, and his answers to the system's questions. 
The systems available at this writing are purely experimental — some- 
times impressive in the depth of their understanding of a few concepts 

but lacking both breadth of language processing ability and depth of 
peripheral memory-system engineering. Even more important, almost 
nothing has so far been learned on an experimental basis of how to use 
a language processor as part of an effective tutorial system. A pressing 
need is upon us to substitute experimental evidence for armchair theo- 
rizing about the educational use of natural language tutorial systems. 

Recommended Research Emphases 

iThe foregoing outlines the area of integrated systems research as 
central to the discipline of computational linguistics which is based on 
syntactic and semantic theories of classical linguists and logicians and 
computational theories of its own. It has been described as an empiri- 
cally oriented area with psychological implications for understanding 
human thought processes and one with potentially great — but still 
unrealized — applications to socially valuable systems. The meeting's 
discussion on integrated systems was rich in its suggestions for research 
and in its discussion of current difficulties in accomplishing it. Other 
areas requiring research become apparent in considering the field's state 
of the art. 

An attempt to summarize the group's suggestions in outline form is 
presented below. Each of the subtopics represents an area where research 
or developmental attention can be expected to advance the state of the art. 

I. Managing Complex systems 

1. High level languages 

2 . Team efforts 

a) Communications among members 

b) Communications among system components 



3. Documentation and programming aids 

a) Text and file editing 

b) Automated flow-charting systems 

c) Displays of structure of subsystems 

II. Interfaces 

1. User 

a) Convenient methods for inserting data 

b) Testing, revising data 

c) Evaluation of language processor (including choice of 

standard dimensions for evaluation) 

2. Peripheral systems 

a) Speech analysis 

b) Speech synthesis 

c) Peripheral memory devices 

d) Text scanners 

III. Computing utility 

1. Central repository of systems, data, algorithms 

2. Distant and local access to a very large computing system 

(or network) 

IV. Theoretical research 

1. Syntactic structures for computation 

2. Semantics of case structures, procedures, etc. 

3. Psychological studies of memory structures 

4. Logical structure of sentence meanings 

5. Speech 

V. Computational linguistic research 

1. Discourse structure 

2. Anaphoric reference 

3. Computational structure of the English lexicon 

4. Development of computational lexicon and grammar for large 

subsets of English (and other languages) 

5. Computational structure of speech 
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VI. Systems research 

1, Large language processing systems with Indefinitely extendable 

peripheral storage capabilities 

2, List processing languages that can effectively use peripheral 

storage devices 

VII. Applied natural language systems 

1. Computer assisted instruction 

2. Text-based question answering 

3. Data-based question answering 

4. Natural language systems for describing algorithms and processes 

to a computer ) i.e., Natural Language Programming Systems 

5. Systems for conversing with a user about a topic 

6. Systems for generating essays from a data base 

7. Simulations of personality and belief structures 

VIII. Inferential capabilitic&i 

1. Verbal problem solving 

2. Theorem proving 

3. Deductive question-answering 

4. Paraphrase 

IX. Esoterica 

1. Emotional content of language and of speech 

2. Interface with visual data 

3. Interface with auditory data 

4. Styllstlcs, rhythm, rhyme, etc. 

5. Semantic classification systems 
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COMPUTER-ORIENTED GRAMMARS AND PARSING 
S.R. Pe trick. Chairman 



General Considerations 

The first of the three sessions devoted to this topic began with an 
attempt to determine what is implied by the phrase "computer-oriented 
grammars". One interpretation suggested by some participants was that 
computer -oriented means "well-specified". One can utilize a grammar in 
a computer system only if one has stated explicitly how that grammar 
designates certain strings as sentences and assigns structure(s) to them. 
The complete specification of a grammar or a class of grammars is not 
directly tied to computers in any fashion. Nevertheless, an indirect 
association is often made for at least two reasons. First, computational 
linguists have of necessity been obliged to make vague or unspecified 
aspects of a class of grammars precise even in cases where a linguisti- 
cally well-motivated defense of their choices was not attempted. Such 
an approach is useful in that It brings to linguists' attention problems 
and required choices that they may have overlooked. Second, in those 
cases where a linguist actually writes a non-trivial set of rules which 
specify a self-contained subset of some language, the use of a computer 
Is required to verify that his grammar does what was Intended. This means 
not only checking that representative sentences are indeed generated and 
assigned desired structures, but also verifying that ungrammatical sen- 
tences are excluded and that unwanted structures are not assigned. A 
computer Is absolutely essenticl for accomplishing this task. Writing a 
set of linguistic rules, i.e., a grammar, is very much like writing a more 




conventional computer program. Just as In the case of a computer program, 
it is virtually certain that an explicit grammar, as first written, con- 
tains errors, and those errors can be removed only through a process of 
extensive ''debugging**. It Is also likely that along with a host of tri- 
vial, uninteresting errors, the debugging process will also turn up some 
more serious, fundamental conceptual errors. 

A second possible Interpretation of the term **computer -oriented 
grammar** that was suggested was **amenable to some task related to com- 
puters**. We have already accounted for the case In which the task Is 
related to the needs of a linguist to explore the adequacy of his 
grammars. The task can also be one arising from an application outside 
of linguistics. Examples Include natural language question answering 
systems, programming systems, computer aided instruction systems, and 
information retrieval systems. There is, of course, no necessity that 
the task be so specific. **Computer-oriented** could simply be taken to 
imply amenability to such a function as parsing or semantic interpreta- 
tion. 

The participants of the first session saw no reason to agree upon 
a choice between these or other possible interpretations of the phrase, 
"computer -oriented grammars*'. Instead, there was general agreement on 
an extensional meaning of the phrase, defined by a set of individual 
efforts . 

Before proceeding to a discussion of those efforts, however, an 
attempt was made to provide a taxonomy which might be useful in grouping 
the granmars for discussion and in exploring their various properties. 
The following types of grammars were cited and very briefly discussed: 
context-free grammars and weakly equivalent grammars, such as categorial 
grammars and dependency grammars; context-sensitive grammars; general 
rewriting system grammars; transformational grammars based on either 
underlying strings or trees; augmented transition network grammars and 
procedural grammars; tree automata; and stratifica tional grammars. 
These classes of grammars were discussed in terms of such attributes as 
the adequacy of the structural descriptions they assign and the coverage 
of English or some other natural language which has been attained through 



their use. It was pointed out that coverage pm6 adequacy of assigned 
structures are not Independent. Some of the most successful ?,rammars 
In terms of sheer coverage (e.g., context-free grammars) have been 
least successful in assigning acceptable structural descriptions. It 
was also pointed out that adequacy of assigned structures is a matter 
of considerable controversy in linguistics. Linguists can debate at 
great length whether a given structure is an adequate representation 
of a sentence; however we cannot begin to consider this issue meaning- 
fully until we have specified the semantic component which is to 
interpret the structure in question. 

In the context of such applications as machine translation, 
question answering systems, natural language programming syatems, and 
information retrieval, there is much less uncertainty about whether a 
given sentence is treated in an acceptable manner. One would thus 
expect the capabilities of these systems to be well-documented. How- 
ever, the facts do not bear this out. It is, in general, impossible 
to assess from published accounts of natural language processing systems 
the limits of the subsets of English which are currently treated satis- 
factorily by their syntactic and semantic components. Syntactic cover- 
age is particularly difficult to determine. It appears unreasonable, 
to be sure, to require the writer of a formal grammar to specify the 
language of his grammar in an alternate, easier to understand, way. 
It is certainly unreasonable to require a complete and precise alternate 
specification. Nevertheless, some approaches to making possible the 
evaluation of natural language system capabilities were suggested. 
Allowable syntactic constructions can be listed, of course. It is 
virtually meaningless, however, to simply list such constructions as 
quantifiers, comparatives, and coordinate structures because no one 
handles them in all of their complexity. Copious examples of essenti- 
ally different sentences that are allowed should be presented. In 
addition, gaps and inadequacies of grammars, also illustrated by 
examples, should be given whenever possible. What a grammar excludes 
or treats in a less than satisfactory manner is just as important infor- 
mation for evaluation as what it handles nicely. Kuno's description of 




his predictive analysis grammar (12) is to be cited as an example to 
be emulated in this regard. 

There is a natural tendency, upon observing that two different 
constructions are provided for in isolation, to assume that the two 
can be combined in a single sentence, perhaps in a number of differ- 
ent ways. Unfortunately, this is an assumption that Is often not well- 
founded, and originators of natural language processing systems should 
take pains to address themselves tcr this point when assessing the 
capabilities of their systems. 

Alternate characterisation of a natural language subset can at 
best aspire to give an interested user or evaluator a rough feel for 
that language. Questions as to the acceptability of specific sentences 
can, of course, be resolved only by recourse to the primary mechanism 
for language specification, the grammar or computer program. For this 
reason the system designer is under an obligation to make that mecha- 
nism available to interested investigators. Detailed and well-organized 
documentation is also essential if the functioning of a language pro- 
cessing system is to be understood. It is, of course, at best a 
difficult task to determine by hand how a particular sentence is 
handled by a given grammar or program. It is much easier tc use a 
working computer program which incorporates an underlying grammar. A 
directed generation program can provide much help, and an associated 
parsing program is an even more convenient aid, although in general 
less efficient with respect to computer time. Such programs should be 
made available for extensive exploration of a system's capabilities, 
and, within reason, questions as to the acceptability of specific 
sentences should be answered by a system designer through recourse to 
his own computer if necessary. 

A final means of natural language system evaluation is possible 
when that systen has both a syntactic and a semantic component, i.e., 
when a complete system exists for determining the meaning of a given 
sentence and taking appropriate action. In this case operational tests 
can be carried out to determine whether users of a language system can 
effectively use that language to solve given tasks, e.g., question- 





answering or natural language programming tasks. There are, of course, 
many variables to be considered in conducting such tests: educational 
background and system-specific training of che user, difficulty of the 
assigned problem, quality and difficulty of the solution, for example. 
Nevertheless, without such attempts at system evaluation it will be 
impossible to determine whether a given system represents an improve- 
ment over its predecessors. Evaluation attempts would be less essential 
if the natural language systems were all used by significant numbers of 
people over reasonably long periods of time. Unfortunately, this is not 
the case. As far as is known, only REL (4), The Lunar Science Natural 
Language Information System (27), Proto-RELADES (15), and perhaps 
CONVERSE (11) have attempted user utilization of a question-answering 
system, and attempts at system evaluation have been meager. 

Status of Current Grammars and Systems 

The second session was devoted to a survey of individual efforts 
aimed at producing "computer-oriented" grammars. These were discussed 
in groups according to the taxonomy previously mentioned. A large 
number of computer-oriented grannia;.* developers were in attendance and 
hence able to summarize the current state of their own projects. Other 
efforts were also noted and commented upon by the person or persons most 
familiar with them. 

One ground rule suggested in session two was that only those 
grammars be discussed for which some computer implementation of sentence 
generation or parsing has been written and is still operational. This 
rule was not always adhered to, because the time at which a program 
"expires" is usually somewhat vague as is the time at which a large 
program under development becomes operational. However, for the most 
part, emphasis was placed on current efforts rather than upon projects 
which are no longer actively pursued. 

Also, no attempt will be made in this report to exhaustively 
survey all of the recent efforts in the development of computer-oriented 
grammars, although such an undertaking was the intent of session two* 
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Because of the limitation of available time, this report cites only the 
efforts which came up for discussion and assessment of their present 
status . 

Beginning the discussion with context-free and equivalent grammars, 
it was observed that work on the Harvard Predictive Analyzer program (12) 
has been terminated and that the system is no longer maintained. The 
grammar is, of course, preserved in written form, and it lives on In a 
PL/I version written by Damerau (3). In sheer coverage of English and 
in the quality of its documentation it has few if any equals, but the 
inadequacy of structural descriptions it assigns to sentences limits 
its utility. 

Kay's work on parsing systems for rewriting systems was next 
reported upon because it specializes in context-free and context-sensi- 
tive versions in addition to providing parsing for certain general 
rewriting system grammars. The current parser is called the "Chart 
Parser" and it was described as a second generation extension of 
earlier work (10). Current descriptions of the Chart Parser and 
specific grammars written for it are not yet available, but they are 
being utilized in the MIND System (8), and documentation in conjunction 
with that project is in preparation. 

Kcplan described a general facility based on the Chart Parser 
which he calls PARSER or SYNTAX MACHINE and which can be specialized 
to accomplish a variety of functions. It was claimed that these 
included not only rewriting system parsing but also augmented transi- 
tion network parsing and transformational generation and parsing. A 
paper on this work is to be included in the proceedings of the NYU 
Symposium on Natural Language Processing (9). 

Very little was said about the CONVERSE System (11) in the absence 
of anyone who has worked on it. Kay reported that his Chart Parser 
has replaced earlier syntactic components utilized by that system. 

REL (4) was described by Dostert as a total software system, a 
subset of English being only one of the user languages available under 
it. Others include algebraic , music, and film languages. The entire 
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system (syntax, semantics, data base representation and accessing, etc.) 
is currently being transferred from an IBM Model 50 assembly language 
implementation to a GE 135 assembly language version. 

A good'^sized grammar is being written at the Linguistics Research 
Center of the University of Texas in conjunction with a German to English 
translation project (21). The syntactic component used in this work 
consists of a sequence of context-free grammars, each augpnented by 
agreement considerations with respect to features associated with the 
nodes. Translation from the structure assigned by one of these grammars 
to a. string that constitutes the input to be parsed with respect to the 
next grammar is accomplished by a translation mechanism similar to that 
described by Lewis and Stearns (13). 

Winograd reported on the current status of his PROGRAMMAR system 
for the syntactic and semantic specification of a language (24). The 
program was reported to be written in a particular version of LISP and 
work is underway on producing a clearer and cleaner version* Some 
versions have already been run on CDC and IBM LISP systems, Althou^ 
the system's syntactic and semantic processing are intertwined, the 
parsing process is scmewhat independent of semantic processing; the 
first syntactic analysis which meets semantic criteria is taken, and 
statement of backup paths to be followed must be explicit. This appar- 
ently complicates the task of extending a FROGRAMMAR-specified language. 

Because of its deterministic formulation, Winograd 's syntactic 
component differs from that of Woods' augmented transition networks. 
The latter provide a nondeterministic continuation mechanism. Also, 
they are grammar-specified rather than procedure-specified, as is the 
case for PROGRAMMAR. In spite of these differences. Woods' and 
Winograd's syntactic components are very closely related. 

Augmented transition networks provide a clean line between syntax 
and semantics. A single syntactic component has been used by Woods in 
conjunction with two different data bases (27). Semantic interpretation 
with respect to those data bases is carried out by the same procedure 
which was used in the Airline Guide application (25). The syntactic 



-28- 




component Is, at the moment, better developed than the semantic com- 
ponent. Not all syntactic structures are currently translated. The 
data bases in question are designed to support defense document re- 
trieval requests and moon rock data query. Results of the former 
application show that 75% to 80% of the sentences in a corpus con- 
tributed by users familiav with the application were parsed after 
necessary lexical doctoring. Of the sentences accepted, the right 
parsing was described as often not found, but a "reasonable" parsing 
was reported to be found 507o to 60% of the time. Implementation of 
Woods' question answering systems has utilized che Bolt, Beranek and 
Newman (BBN) LISP System. 

Robert Simmons reported the use of a system which he described 
as "a variant of Woods'", hence presumably also formulated as an 
augmented transition network. Deep structures are similar to those 
of Fillmore's case grammars. A second grammar is used to map those 
deep structures to paraphrases of the original sentences. A CDC LISP 
System is used as the program's means of implementation. 

The NYU String Program (18) is consistent with Joshi's string- 
based transformational grammars. The program is based on two sequential 

steps string decomposition and transformational decomposition of 

analyzed strings. Work is still in progress on the latter step. The 
existing program and underlying granroar have been used in analyzing 
sentences taken from English scientific articles. It was reported 
that a "correct" parse was included among the first three obtained 
about 657o of the time. An option is provided for collecting into a 
single structural description distinct structures which differ only 
with respect to the assignment of a modifier to a particular host. 
Resolution of these ambiguities requires further development of 
selectional restrictions. 

Joshi referred to work on imposing conditions on a transforma- 
tional grammar that insure generation of a recursive language and also 
to work on relating string-based and tree-based transformational 
grammars (7). An algorithm for syntactic analysis with respect to the 
former class of grammars has been developed, and a particular large- 




scale gramnar of English belonging to this class Is being written by 
Danuta Hlz* Only a small part of the grammar has been tested so far, 
and documentation of the grammar Is not expected to be completed until 
May 1973* It has not yet been decided whether to make use of the NYU 
String Program In achieving a capablllry for the syntactic analysis of 
sentences with respect to this grammar or. Instead, to produce a 
completely Independent Implementation, 

Work on a tree-based class of transformational grammars Is being 
conducted at the IBM Research Center, The class of grammars In question 
Is an extension of the class previously allowed by Petrlck (16), and 
provides for: binary features to be associated with terminal and non- 
terminal nodes, feature-sensitive and feature-manipulative transforma- 
tional rules, a general labelled bracket condition of transformational 
applicability, the expression of supplementary conditions of transforma- 
tional applicability, and an extended mechanism for specifying desired 
structural changes. An IBM 360 LISP System implementation is in 
operation, and a grammar is under development for a question answering 
system application (17). This grammar currently has nine base compo- 
nent context-free (CF/ phrase structure rules, 134 surface structure CF 
rules, 170 derived constituent structure CF rules, and 37 rather complex 
transformations. An early version of the grammar was tested using 
Friedman's transformational grammar tester program (6). 

Friedman's program has, in fact been used in developing and testing 
a nimiber of trans foirmational grammars. There are a number of reasons 
for this: the program is written in FORTRAN and is relatively exportable, 
it is well documented, and it was given a degree of generality in both 
transformational expressability and cyclic control which permits special- 
ization to different theoretical models. 

Two past effortct carried out at the IBM Cambridge Scientific Center 
were discussed by Moyne. These are the Proto-RELADES System (15) and 
the CUE System (14). The former, designed for use in a library catalog 
question answering system, utilizes a syntactic analysis specified 
gramniar, i.e., a surface structure context-free grammar and a set of 
transformations to map the surface structures thus specified into 




corresponding deep structures* Hence, this linguistic model Is seen to 
be rather different: from the usual generative transformational model* 
The CUE System features more extensive syntactic facilities but no 
semantic Interpretation with respect to a data base* In this syntac- 
tic model context-free parsing and analysis-specified transformation 
are Interspersed rather than separated as In the Proto-RELADES System, 

Three distinct components are used sequentially in Grenoble 
University's parser, which is part of a Russian to French automatic 
translation program (23)* The first component Is devoted to morpho- 
logical analysis* It uses a finite state transducer to map strings 
to strings. The second component uses a CF grammar augmented with 
context-sensitive features, and a bottom-up parsing algorithm to map 
the output strings of the first component Into surface structure trees* 
The third component utilizes a tree to tree transducer having the power 
of a Turing machine to convert surface structures Into "tree-formulas" 
which represent the elementary statements of a sentence and their 
logical connections* These, In turn, provide the basis for transla- 
tion to the target language* The system has beer tested on a corpus 
of about 400,000 words, divided Into texts of about 3,000 words each* 

The University of Montreal Incorporates another type of parsing 
system In Its English to French translation program* The Q-System (2), 
a rewriting system related to that of the MIND System, Is used to 
separately provide models for morphological and syntactic analysis* 
These are sequentially applied In obtaining a structure which Is the 
basis for English to French translation* Testing of the system on 
random texts taken from colloquial conversations and from the field 
of economics Is currently In progress* 

A number of other efforts related to computer^ oriented grammars 
were either discussed very briefly or !u<%rely mentioned. These are 
represented by appropriate references* No attempt has been made to 
Include In the references all relevant papers; Instead, only the most 
recent papers describing a particular line of research are listed and 
the Interested reader can consult them for references tc past work* 
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Parsing 



The third session focused on parsing. Although an attempt was 
made to elicit information on computational time requirements, the 
incommensurability of the systems in question all but precluded 
meaningful comparison. Even more Important than the characteristics 
of the different computers used are the differences In the syntactic 
coverage of languages employed as well as the differences with respect 
to the adequacy of structural descriptions assigned. A time figure 
la not really meaningful without pinning down the sentence, grammar, 
computer, and parsing algorithm. Times cited for a fifteen to twenty 
word sentence ranged from ten to as much as 130 seconds . 

A few other topics related to parsing were also brought up. The 
first relates to the question of the relative efficiency of inter- 
spersing syntactic analysis and semantic interpretation as compared 
to sequentially performing those functions. Clearly, it can be no 
less efficient to allow (but not require) intersperslon of these two 
components. It is not cle^r, however. Just how much semantic inter* 
pretation prior to completion of parsing is optimal. This depends, of 
course, upon the particular syntactic analysis and semantic interpre- 
tation components in question* It is interesting to note, however, 
that for two systems reporting experimentation designed to determine 
the cvptlmal amount of semantic interpretation to be carried out prior 
to finishing syntactic analysis, the answer turned out to be none. 
That is, it was found to be more efficient to delay semantic inter- 
pretation until completion of syntactic analysis. Evidently, the time 
lost in semantically Interpreting structures which would later have 
been precluded for syntactic reasons was greater than the time gained 
by blocking semantically anomalous structures that would only later 
have been blocked for syntactic reasons. The two projects in question 
were the REL effort (4) and the BBN Lunar grammar (27). Results for 
the former are not so surprising because of the relatively high cost 
of semantic interpretation due to the use of secondary storage for 
data base representation; however, the BBN Lunar grammar used a single 
level of storage. 
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Although a few new topics related to parsing were Identified, 
very little In the way of results was reported. These Included the 
following: (1) "errorful" parsing. I.e., parsing strings generated 
by a given grammar plus an additional noise component; (2) parsing 
algorithms that proceed from given anchor points ?n a sentence 
rather than proceeding strictly lef t-to-rlght or rlgh t-to-left ; 

(3) parsing v/lth respect to phonological component grammars, and 

(4) natural language subset extension. I.e., Incremental modification 
of a grammar both directly and through statements In the language of 
that grammar. 

To describe the current situation with respect to developing 
large-scale grammars and syntactic analyzers, one can say that this 
activity Is certainly not currently being undertaken by many linguists, 
whose typical practice consists of examining isolated but hopefully 
key linguistic phenomena and proposing modifications to linguistic 
theory on the basis of that examination. Putting together a large 
set of rules (often, any explicit rules) is very unfashionable, as 
is the detailed and complete specification of a proposed class of 
grammars (i.e., a linguistic theory). Nevertheless, there is some 
ongoing research in these areas, and one could hazard the identifica- 
tion of three distinct types of models being used. The first is some 
kind of a phrase structure grammar, often a context-free grammar 
frequently augmented by binary features and even by a set of "trans- 
formations** which map surface structure into a *'dceper*' representa- 
tion. This first model is not based on an underlying transformational 
grammar . 

A second class of models consists of generative transformational 
grammars. There have been more rule testing systems than parsing 
systems developed for these grammars. The parsing systems that have 
been developed formally resemble to some extent the **trans formation- 
ally** augmented variant of the first class of models; they are, how- 
ever, usually slower because of the necessity that certain nondeter- 
minis tic continuations be followed. 
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The third class of grammars Includes such efforts as those of 
Woods (26), Winograd (24), Bobrow and Fraser (1), and Thome, Bratley, 
and Dewar (22): augmented transition networks and their related 
efforts. Proponents of this class of models hold forth the hope 
that their devices will prove useful not only as a primary means of 
linguistic description but also as a useful formalism for specifying 
parsers of the other two classes of grammars. They have, indeed, 
been used to construct a parser for a given context-free grammar; 
however, more general results, in particular the construction of a 
parser for a given transformational grammar, have not been obtained. 

One obs3rvation on these three classes of grammars concerns the 
types of structures they assign to sentences. They all are capable of 
assigning tree structure (with associated features) to sentences, and, 
as used by their proponents, they all refer to deep and to surface 
representations. Not long ago the view was commonly held that although 
there were considerable differences between the models in use in 
computational linguistics, the differences between the structures they 
assigned to the same sentences were not so great. This was, in any 
case, the consensus expressed at the Ames Seminar on Text Processing 
Research of January 1970. It seems less true today, however. We 
have differences between structures assigned by case grammars and non- 
case grammars, by generative semantic and interpretive grammars, etc. 
Indeed, the terms deep and surface structure are relative rather than 
absolute, and the term deep structure in particular may refer to very 
different types of structures in different linguistic theories. 

A final question about the grammars identified in this report 
concerns the degree of their capabilities for specifying subsets of 
a natural language and assigning acceptable structures to the sentences 
specified. This question was raised during both sessions, and, although 
it could not be answered clearly, a number of suggestions were offered 
to grammar writers and parsing system implementors . If followed, 
these suggestions would probably make an evaluation of those capabili- 
ties possible. 
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MACHINES AND SPEECH 
Franklin S. Cooper, Chairman 



Introduction 

Speech, as a topic for the Conference on Research Trends In 
Computational Linguistics, was Included In the agenda because of Its 
emerging relevance to the field. This factor set the tone for the 
opening discussions within the group and was largely responsible for 
the fact that most of the group's attention v;as spent on the two 
topics for which the overlap with the subject matter of conventional 
computational linguistics was most clearly evident. There was tacit 
agreement that many of the problems that concern speech researchers 
and for which they regularly use computers would not be of Interest 
to most of the conferees. Thus, there was little discussion of such 
current speech areas as the physiology of speech production and 
audition, perception of speech and speech-like stimuli, cross -linguis- 
tic studies of speech and universal phonological theory, children's 
language acquisition, speech and hearing disabilities, etc. 

Instead, the discussion centered on speech production by computers 
In the context of reading machines for the blind and on speech recog- 
nition by computers as a central problem In designing speech under- 
standing systems. There was In addition some discussion of the research 
tools needed both for research In depth at major research centers and 
for graduate level training In a larger number of university laborator- 
ies. The Chairman, In his report to the plenary session, dealt only 
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briefly with the state of the art in speech research but put primary 
emphasis on research opportunities, covering some areas in addition 
to those which had received attention in the group discussions. 

Speech as a Part of Computational Linguistics 

Computational linguistics, as defined by past usage, has dealt 
mostly with written rather than spoken language. This is mainly an 
historical accident; nevertheless, the time has come to examine the 
areas in which speech may now be considered a part of thi? fields 
There are, indeed, new factors to be considered: one is that the 
domain of automated language processing is, in practice, being ex- 
tended to include systems that generate speech as an output and that 
accept it as an input. Another reason, based in theory, is a growing 
awareness of parallels between low level processes of speech produc- 
tion and perception and higher level processes involving syntax and 
semantics. This has come about mainly as a consequence of psycho- 
linguistic experimentation on human language processing at all levels, 

although most of the effort and the more penetrating methodologies 

have been applied at the level of speech. Thus, research on human 
processing of language, speech included, can serve to suggest areas 
and methods that computational linguists may wish to explore. 

This is not to say that all of speiech research ought to be co- 
opted into computational linguistics: much that is merely descriptive 
or that is concerned with the technology of voice communication has 
little to offer or to gain in return. A reasonable criterion might be 
that there should exist a mutual interdependence between the speech 
processes and the higher level processes involved in the same overall 
automated language operation. This, at least, was the basis we used 
in choosing topics for discussion, and in considering practical appli- 
cations. The nature of the interdependence will become evident in the 
following discussions of reading machines and of speech understanding 
systems . 
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Speech as an Output of Automated Language Processing: Reading 
Machines for the Blind 



Most of the familiar instances of speech from machines telephone 

directory assistance, airline announcements, etc. are essentially 

uninteresting to computational linguists. They involve such limited 
vocabularies and fully defined syntactic structures that adequate 
speech can be had simply by using prerecorded words and phrases or 
their synthetic equivalents. The more general problem of generating 
speech from unrestricted text is only now approaching solution, primar- 
ily in connection with reading services for the blind. Indeed, many of 
the problems that face any automated speech output device of a non- triv- 
ial design can be described and analyzed by detailed consideration of 
this single application. The practical problem of providing such a 
service has many additional aspects, nonlinguistic as well as linguistic. 
Here we shall undertake no more than a sketch of component operations 
and problem areas. 

The primary function of a reading machine for the blind* (so called 
because of the many attempts to build simple, portable devices) is to 
provide blind people with adequate access to the full range of printed 
materials that sighted people read. An approach that has been tried 
repeatedly during the past sixty years is to use a photoelectric probe 
that converts letter shapes into sound shapes which the blind user is 
expected to learn. This task, it is argued ^ is no worse than learning 
a foreign language; indeed, it should be easier, since the lexicon and 
syntax are those of English. In practice, learning is laborious and 

reading rates are disappointingly low comparable to Morse Code and 

roughly an order of magnitude less than listening to spoken language. 
The reason for the superior performance of spoken language is by now 



*rwo recent reviews that deal with the general problems of sensory aids 
for the blind (including reading machines) and that give extensive 
references to the literature are ''Electronic Aids for the Severely 
Visually Handicapped" by Jonathan Allen in Critical Reviews in 
Bioenglneering , D. Fleming (Ed.), Chemical Rubber Co., 1971, pp. 139- 
167, and "Sensory Aids for the Blind: A Challenging Problem with 
Lessons for the Future" by P.W. Nye and J.C. Bliss, Proc. IEEE , 58, 
1878-1898 (1970). 




quite clear: speech is a highly condensed encoding of the message, 
whereas the letter sounds provide only a serial encipherment into 
acoustic form and so are far less efficient carriers of linguistic 
information. 

Both theory and experience suggest, therefore, that high perfor- 
mance by a reading machine requires that it be able to speak plain 

English or wliatever language is being used. The complexity and size 

of such a machine are such that it will have to be, for the present at 
least, a central facility that records tapes on request, or possibly 
serves a remote user who is linked to it by telephone. Many of the 
practical problems are therefore those inherent in the organization, 
financing, and administration of any sizeable service function such, 
for example, as a time-sharing computer system. 

In the operation of a reading service center, the first step is 
to obtain a machine-readable alphanumeric tape from the printed text 
that was requested by the blind user. Sometimes compositor's tapes 
(from which the book was printed) will be available, but usually the 
printed page must be converted by optical character recognition (OCR) 
machines, or by manual keyboarding. [This problem of entering data 
from the printed page is shared with many projects in computational 
linguistics. At the present time, there are no suitable OCR machines 
available as standard equipment, and only a few companies are prepared 
to supply special machines capable of reading proportionally-spaced 
characters in multiple fonts. Service bureau facilities for reading 
conventional printed text are likewise limited and expensive. The 
reason for this situation lies only partly in the technical difficulties; 
mainly, th.4re is less customer demand for such devices than for simpler 
and cheaper machines designed to read at high rates the specialized, 
uniformly-spaced characters from credit cards and business documents. 
It may well be that the needs of computational linguists, as well as 
those concerned with reading centers for the blind, can cnly be 'met 
through a development project aimed at their special needs, i.e., for 
moderate accuracy, moderate speed, and reasonable cost in d machine 
with enough virtuosity to recognize the commonly used fonts and to scan 




bound volumes. Whether or not such a project falls within the scope 
of computational linguistics remains open to question. It could 
nevertheless be so useful for text Input to computers that a good case 
can be made for It.] 

The next step, once the text Is available to the computer In 
alphanumeric code. Is to arrive at the pronunciation of each word In 
terms of some appropriate phonetic notation. Due to the nature of 
English orthography, no simple set of rules can be employed to derive 
an acceptable guess at the proper transcription for all English words, 
though there are spelling regularities that may, perhaps, be exploited 
to advantage. Hence, some kind of pronouncing dictionary (In machine- 
readable form) Is essential. Two general approaches are being trlid: 
(1) There Is the straight-forward, pragmatic one of storing the phonetic 
equivalents for every word of a large lexicon. Including separate entries 
for most root-plus-af f Ix combinations. This allows the specification of 
Inherent lexical stress, as well as of a code Indicating the usual 
syntactic role of the word. A dictionary of this kind with approximately 
150,000 entries, about equivalent to a desk-type dictionary. Is easily 
accciranodated on a single IBM 1316 disk pack. (2) With a more analytic 
approach, considerable savings In dictionary size, perhaps ten-to-cile 
for very large dictionaries, may be achieved by attempting to break the 
inpu*": words Into their constituent morphs . However, word pronunciation 
is not a simple function of the pronunciation of its constituent parts: 
for example, a suffix may shift the placement of primary stress and 
force a change in vowel quality. In any case, the dictionary must 
contain phonetic equivalents for the full set of morphs and for a 
substantial number of frequent words that violate spelllng-to-pronuncla- 
tion rules. Research problems of considerable interest are Involved in 
such an approach: for example, the development of a set of pronunciation 
rules based on an underlying representation of English morphemes, working 
out algorithms for word decomposition, finding rules for the placement 
of lexical stress in words that are reconstituted fvom their morphs, and 
inferring syntactic roles that such words can play. 

Once a canonical phonetic representation has been obtained for each 
word in the text, it is necessary to employ a set of phonological rules 
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which determine how the sequence of words which constitute an inte- 
grated sentence should be spoken. Perceptual experiments with 
concatenated words, using recordings of single words spoken in iso- 
lation, have demonstrated the importance and the extent of the 
sentence-level recoding of spoken word strings. As an example at 
the segmental level, a rule transforms the phonetic sequence [d, 
word boundary, yl into [jl^ which means that the normal segmental 
realization of the sentence 't)id you go?'* is actually [dl5ugo] . 
\^ile effects of this sort are familiar, the exact form which the 
rules take is not known and the inventory of such segmental rules 
is far from complete. Finding these rules is a challenging research 
problem and one to which computational methods can make an important 
contribution, especially in testing the reliability and range of 
application of proposed rules. 

Of more importance than such segmental rules are phonological 
rules that determine the temporal organization of the acoustic output 
and the fundamental frequency of vocal cord vibration as a function of 
time. Segmental durations and intonation contours are influenced by 
a number of factors, especially the syntactic structure and stress 
pattern of the sentence. Indeed, some sort of syntactic analysis is 
essential for the synthesis of a satisfactory, spoken sentence. In 
some degree, structure can be inferred from orthographic punctuation, 
and rules based on punctuation are sometimes sufficient. Nevertheless, 
better methods are needed. This dependence of speech quality on 
syntactic structure is perhaps the major area of overlap between the 
reading machine problem and conventional computational linguistics, 
and a promising target for further research. 

At this point in the process of generating speech from written 
text, the computer has assembled a phonetic string that has had 
appropriate allophonlc and stress adjustments and that has been 
marked for intonation and juncture. It remains to convert this 
phonetic description into control signals that will operate a hardware 
synthesizer, or its simulation in software. Two general methods of 
speech synthesis by rule are currently utilized: (1) For a termlnal- 
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analog synthesizer, the rules manipulate acoustic variables directly, 
e.g., sound source type and pitch, formant frequencies, and Intensities. 
In this case, the rules for synthesis begin by consulting tables for 
the canonical form of each phone, then computing the necessary context- 
ual adjustments (corresponding approximately to coartlculatlon in 
human speech). Typically, there are about a dozen parameters that 
are used to control a hardware synthesizer; these are specified at 
regular Intervals of about ten milliseconds , requiring a total output 
bit rate of about 4800 per seconds (2) For an artlculatory-analog 
synthesizer, the rules typically manipulate artlculatory variables 
such as the positions and shapes of simplified models of tongue, lips, 
velum, larynx, etc. The resulting shapes (of the model vocal tract) 
are then used to compute an acoustic output, or the control signals 
for terminal-analog hardware. (It should be pointed out that substan- 
tially more work has been done on terminal-analog models and rules 
which currently provide a more intelligible output than do artlculatory 
Implementations; however, artlculatory models are Improving rapidly and 
are of greater theoretical Interest because rules of coart Icula tion may 
ultimately be built in as automatic constraints. In addition, modelling 
of this kind may lead to a better understanding of the physiology of 
speech production; Indeed, this is an area of promise for future re- 
search to which the present discussion will return. 

The intelligibility of the synthetic speech currently produced by 
rule with terminal-analog systems is surprisingly good. Tests of conso- 
nant-vowel words and nonsense syllables synthesized by rule have shown 
that listeners are able to Identify them correctly 95% of the time. 
Systematic testing for longer words and sentences has not been done, but 
interviews and Informal tests have been carried out with blind students 
who have listened to chapter-length recordings of textbook materials. 
Pew words are missed and overall comprehension is high. Nevertheless, 
it is clear that the treatment of consonant clusters and of stress and 
phrasing needs Improvement and often seems unnatural. Future improve- 
ments in the rules will depend on careful analyses of natural speech 
and the systematic manipulation of synthetic speech to test the percep- 
tual relevance of proposed changes In the rules. 
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The evaluation of synthetic speech in terms of its real-life 
utility to blind individuals is a major task. The speech itself needs 
to be considered along two dimensions — intelligibility and naturalness . 
The voice quality of current synthesizers is not as natural as would be 
desirable, despite efforts to improve it. This may imply that we are 
ignoring some critical variables such as glottal source irregularities 
or, alternatively, that naturalness will not be achieved until the 
synthesis rules are improved sufficiently to avoid all of the conflicts 
between acoustic cues and message content. It is interesting to note 

and ultimately encouraging — that it is the rules in their present 

form and not the synthesis hardware that is at fault; in fact, extremely 

good synthetic speech indistinguishable from the spoken version can 

be made by meticulous hand adjustment of the control parameters. 

But complete naturalness may be too much to expect for speech that 
is synthesized by fully automatic methods. Evaluation must then deal 
with the question of how useful the product is for, say, the blind 
student in preparing his lesson assignments. Its principal advantage 
over natural speech is that he should be able to get the material he 
wants when he needs it, not some weeks or months later as often happens 
with recordings by volunteers. The computer can read tirelessly and 
faster than a human, though its actual performance will depend on how 
well the service function is organized. Good intelligibility of the 
sjmthetic speech is, of course, essential but this may not be an adequate 
criterion. It might be that listening to synthetic speech imposes so 
mi?ch perceptual load that comprehending and remembering the content would 
be excessively difficult; hence, comprehension tests and measures of 
fatigue are more likely to be relevant than intelligibility tests in 
evaluating the practical usefulness of computer-generated speech. Eval- 
uations of this kind are being started, in parallel with efforts to make 
the synthetic speech sound more natural. 

This sketch of the reading machine problem has pointed to some of 
the areas of interdependence between speech research and computational 
linguistics. Thus, many problems of dictionary management are shared. 
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Spelllng-to^sound rules and algorithms for decomposing words into 
constituent niorphs would reduce the size of the dictionary needed for 
a reading machine, just as comparable algorithms for syllabification 
do for automated typesetting. Likewise, reliable methods for recon- 
stituting words and for inferring their usage would be useful to 
either a machine that must read them aloud or to one that is composing 
written responses. But the common ground is most evident at the level 
of syntactic analysis. An efficient general purpose parser is almost 
equally necessary for properly rendering a sentence into spoken form, 
or for inferring its content from its written form. For the present, 
reading machines must depend on explicit punctuation and a pseudo 
parse of some kind; perhaps, short-cut methods that yield good speech 
would also have practical application to the automatic punctuation of 
synthetic written responses. The existence of this common ground 
implies research opportunities on the interrelations of spoken and 
written language---another topic for later discussion. But before 
dealing with areas for research and practical application, the report 
will give an account of the discussions on speech understanding systems. 

Speech as an Input to Automated Language Processing; Speech Under- 
standing Systems 

Most of the discussions in this session centered on Advanced 
Research Projects Agency (ARPA) -sponsored projects on speech under- 
standing systems, underway for about six months. Five major research 
groups, already engaged in other ARPA -supported work, are involved. As 
an initial step, a study group assembled from these projects analyzed 
the problems and prepared a set of objectives, specifications, and 
plans.* 

The overall objective is to develop one or more demonstration 
systems, primarily to show that the technology now exists though it 



*"Speech Understanding Systems: Final Report of a Study Group". Pub- 
lished by Computer Science Department, Carnegie -Me lion University, 
Pittsburgh, Pa., May 1971. 
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Is scattered — to make such an undertaking feasible. Each of the 
major research groups Is undertaking a task of Its own choosing, but 
with the expectation that a cooperative effort on some one of these 
projects, or an entirely different one, will emerge. Definition of 
the tasks was considered to be a crucial point In setting the level 
of difficulty and the chances for success. Indeed, the long back- 
ground of failures to solve the ''speech recognition problem" was In 
part responsible for the ARPA decision to undei*take a related, but 
more manageable, program. 

The machine recognition of speech has been a persistent challenge 
for at least the past twenty years. Several ways have been found to 
recognize spoken digits, or even a few dozen words when they are spoken 
singly and carefully. But the general problem, usually put In terms of 
a phonetic typewriter or a system for automatic dictation, has remained 
elusive. The much larger vocabulary that Is required, the necessity of 
dealing with connected speech, and the need to accommodate a number of . 
different speakers have all posed severe difficulties. Moreover, as 
more has been learned about the nature of the speech signal and Its 
understanding by humans, the clearer has become the magnitude and com-* 
plexlty of the recognition problem. It should be noted that most attempts 
thus far to deal with the general problem have used a **bottom-up'' approach, 
I.e., one In which phonetic elements, words, and sentences, are all found 
by successive analyses based on the acoustic signal. The speech under* 
standing systems projected by the ARPA Study Group differ In two Important 
ways: constrained objectives make the problem more manageable^ and 
reliance Is not placed on bottom-up analyses. 

Speech understanding. In the present context, means essentially that 
a computer, when told to do something, will "understand'' well enough to 
take the correct action or will ask for clarification. If, for example, 
the computer has control of a robot, then a command to put the red block 
In the box can lead to an easily confirmed performance, provided there 
Is a red block, and a box that Is larger than the block; otherwise. It 
should lead to a question or an error message. Or again, If the computer 
contains a file of Information about moon rocks, It should be able to 
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answer Inquiries about the numbers, sizes and chemical compositions of 
those rocks, whether the query is phrased as a question or as a demand 
for data. Obviously, tasks of this kind are multidimensional and can 
be constrained in ways that will, in fact, determine their difficulty. 

The attempt has been to choose tasks more or less like the two 

mentioned that are constrained in such a way as to make them manage- 
able, but not to the point of making them trivial. A practical payoff 
was not considered a mandatory requirement. 

Performance in the task situation not only limits the number of 
possible responses to the voice input. It also provides additional bases 
for analyzing it. Thus, limitations on the vocabulary, on the syntax 
that Is permitted for the task, on allowable operations to be performed, 
and predictions of the probable behavior of the person speaking — all 
provide bases for making hypotheses about the actual message carried by 
the incoming acoustic signal. Indeed, It is on such lexical, syntactic, 
semantic, and pragmatic "support" that the research groups are putting 
their hopes and focusing their efforts; although the acoustic signal Is 
not being Ignored > it Is receiving far less attention than it has In past 
efforts to solve the general recognition problem — quite possibly, less 
attention than it will need to receive If effective use Is to be made of 
"top down" or "side In" approaches. 

The ARPA Study Group's final report discusses at length the kind 
of support, and the nature of the problems, to be expected at the 
various levels. Interdependence across levels Is a characteristic of 
the entire undertaking, and there Is much overlap with the domain of 
conventional computational linguistics. There are challenging differ- 
ences, too: neither sentences nor words are well formed, as one would 
expect them to be In written text; moreover, the need for live inter- 
action between user and computer means that the computations must be 
done in real time or less, much of It while the sentence Is still being 
spoken • 

Only at the very lowest levels, where parameters are being extracted 
from the acoustic signal and are used to guess the phoneme string, are 
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the problems wholly within the domain of speech research. Here, al* 
though engineering problems of pitch derivation and formant tracking 
are not trivial, the major difficulty lies in inferring the phonetic 
string — a difficulty that may be inherent in the nature of speech 
itself. It is clear that in the production of speech there is much 
restructuring (or encoding) of the segmental units to achieve a 
compact, smoothly flowing output; that is, there is much overlap of 
the gestures for units that are themselves sequential. It is, indeed, 
the rules for this coar ticulation that provide a basis for speech 
synthesis by rule. But the rules we use are generative rules; the 
inverse rules, to the extent they exist, are largely unknown except 
for those phones and contexts where coartlculation is minimal and 
the **code*' can be said to be transparent. A general paradigm, pro- 
posed by Stevens some years ago and labelled "analysis*-by-synthesis", 
uses heuristics to guess the phonetic string and then confirms by 
synthesis, or uses error signals to make a better guess. An alterna*- 
tive, referred to by the Study Group as *'hypothesize-and-test*', looks 
for transparent places in the code (and for other acoustic landmarks) 
to generate a much less complete hypothesis about the phonetic string, 
and then brings to bear information from higher level processes. An 
example would be to look for a word that begins with strong, high 
frequency noise, then guess the word to be **six** if it has one syllable, 
or ''seven" if two, provided one has other reasons to expect a spoken 
digit. 

Obviously, there are significant research problems at the speech 
level, as vTell as practical difficulties, for a speech understanding 
system: can a set of rules for analysis be devised? or even a partial 
set that would have practical utility? What kinds of heuristics will 
be most helpful if one resorts to analysis-by-synthesis? What relative 
reliance should be placed on extracting as much information as possible 
from the acoustic signal versus having to depend on support from higher 
linguistic levels? Underlying these questions is the assumption that 
present knowledge of the acoustic cues is reasonably complete; in fact, 
much remains to be learned about the cues that operate in consonant 
clusters and in connected speech, both careful and casual. 




At the next higher level, the principal problem is how to convert. 

the string of phonetic elements perhaps Incomplete, and certainly 

error-prone — Into a string of words, which may also have intervening 
gaps. Word boundaries are not at all evident at the acoustic level, 
ar«d so do not appear in the phonetic string. Segmentation into lexical 
elements must then proceed mainly by matching to strings that corres- 
pond to words In the task vocabulary. Often the low-level analysis 
will have generated more than one possible candidate for each phone, 
omitted an element, or supplied a wrong entry. The matching operation 
must somehow avoid the combinatorial explosion that could readily 
occur if there were several options for each of a string of phones. 
Omissions and errors pose obvious additional difficulties. There is, 
therefore, serious need for support from higher levels as well as 
efficient analysis at the phonetic level. 

The construction of sentences from a partial and "noisy" string 
of words can draw support from whatever restrictions the task may im- 
pose on the synuax, or from information about the location of syntac- 
tic boundaries that can be Inferred from the suprasegmentals found in 
the acoustic signal. The latter relationship is essentially the 
Inverse of the dependence of synthesis on syntactic Information to 
assign stress and intonation and thereby generate spee^rh that sounds 
natural. There are other problems at the sentence level that have 
their counterparts in parsing written text. They differ, though. In 
that analysis of the spoken "sentence" will often have to deal with 
intrusive hesitation sounds, with non-well -formed or Incomplete sen- 
tences, and with both backwards and forwards parsing from a starting 
point somewhere within the utterance. There are obvious research 
problems of considerable importance In devising methods for parsing 
under these conditions. 

Interpretation of the sentence as a basis for action can rely 
only In part on the output of the syntactic analysis, since that out- 
put Is likely to be faulty or ambiguous. Good use will need to be 
made of the semantic constraints Imposed by the task, and of pragmatic 
Information about the behavior to be expected of the human operator. 




Except for such help, the task of interpretation has all the complexi- 
ties inherent in the content analysis of written text. If the appro- 
priate response from the computer is an answer to a user's question, 
then various cognitive functions must also be performed and a sentence 
must be generated that is appropriate in both content and form* The 
response itself could be in either written or spoken form, with the 
latter making use of techniques for text-to-speech conversion developed 
for reading services for the blind. 

It will be evident that speech understanding systems are involved 
with language at all levels, and pose many problems that should interest 
computational linguists. The differences between these problems and 
more familiar ones reflect deep differences between oral and written 
language. Even so, this account has probably understressed the perva- 
sive influence of speech at all levels of the speech understanding 
problem, and not merely those levels to which the term is usually applied 

Research Areas With Speech Involvement 

The preceding discussions of speech outputs from computers (for 
reading machines) and speech inputs to computers Cin speech understanding 
systems) have exposed a number of important areas for research in compu- 
tational linguistics. Some of the basic problems that were mentioned in 
passing deserve additional consideration. 

Translating Between Written and Oral Language 

Alth •uf.h it may be overdramatizing the differences between written 
and oral language to speak of translating the one into the other, it may 
nevertheless suggest a useful point of view in reexamining some old 
problems and considering some new ones. We have seen that the research 
on reading machines for the blind makes explicit some written/oral 
differences that often pass unnoticed. Ideally, a reading machine should 
convey all of the useful Information that is on the printed page. This 
may include much more than the bare text we have so far considered, even 
without taking account of pictures and diagrams, a task far beyond 




present capabilities. The well printed page uses many typographic 
devices to organize and modify the literal text: punctuation marks 
are so coiranonly used that we think of them as a part of the text, 
though their realization In sound follows very different rules than 
those applied to the letter; capitalization, too, Is widely used, 
and for a variety of purposes; then there is the judicious use of 
different type fonts, sizes, weights, and leadings; and paragraph 
indentation Is only one of the devices used to indicate breaks, sub- 
ordinations, listings, etc. 

All of these carry valuable Information to the eye. How, and 
to what extent, can this Information be "translated" for easy use 
by the ear? Not, one would hope, by overt description of the typo- 
graphy. The commonest forms of punctuation have acoustic reflexes 
that are fairly simple and regular. Are there comparable acoustic 
signals for other graphic symbols? The Inverse transformations 
Involved In writing are no easier, though they are more familiar: 
thus, oral questions, statements and exclamations are indicated by 
their standard typographic signs; likewise, emphatic stress can be 
signalled by Italics. But how does one convey a note of Incredulity 
or petulance without resorting to bald description? We know that 
the skillful writer can do It; perhaps, then, ways can be found to 
convey typographic messages of comparable subtlety to the blind 
listener In a reasonably graceful way. As a practical matter, when 
one Is converting a compositor's tape Into speech, something must 
be done with each of the graphic signals that it contains. 

If compositors' tapes seem to carry too much information about 
graphic details, they can be accused equally of carrying too little 
of the essential information about the sentence. Where, for example, 
Is sentence stress to be placed during synthesis? Where are the 
breaks Into breath groups for a long phrase? And just how should 
the intonation be handled? But it is hardly correct to say that the 

Information or most of It is not provided; rather, it Is burled 

In the structure, which Is why good speech synthesis will be so 
dependent on adequate parsing. This assumes though that full know- 




ledge of structure is a sufficient condition for making speech that 
sounds natural. Even if true, much has yet to be done about finding 
the rules by which sentence structure can be converted into sentence 
suprasegmentals • Rules for the inverse transformation from supra- 

segmentals to structure to the extent that they exist — could be 

extremely helpful in supplementing the phonetic analysis in a speech 
understanding system. A related problem that dips more deeply into 
conventional speech research is the relationship between stress and 
intonation (as linguistic entities) and the relevant dimensions of 
the acoustic signal. These relationships are known only in part. 

The written sentence is, nevertheless, incomplete in at least 
some respects: witness sentences that are ambiguous in written form 
but not in spoken form, because the real-world context is missing 
in the one case but often is indicated (by stress and intonation) in 
the other. One can find ambiguity at the lexical level, too wit- 
ness homographs and homophones though the ambiguity in one mode of 

expression is usually resolved in the other. It might be interesting 
to explore the conditions for ambiguity wherever it occurs. 

These are a few of the translation problems that one would en- 
counter in dealing only with English. It is easy to see, from the 
different structures and orthographies of other languages that dif- 
ferent but probably not fewer — problems would arise with them. 

Finally, the general problem of working back and forth between oral 
and written versions of the same language will be one of increasing 
concern to computational linguists as automated language processing 
becomes more and more involved with speech as its input and output 
modes • 

Modelling Speech Production 

We have seen that the rules by which speech is synthesized 
necessarily deal with the higher level processes that are the normal 
domain of computational linguistics. Although the models of speech 
production to which those rules apply lie more nearly in the usual 
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realm of speech research, the models must operate from higher level 
control signals. Hence, the development of speech models and the 
organization of their control signals is an area of research that is 
relevant to computational linguistics as well as important in itself. 
Additional reasons are that the processes of speech production 
parallel those at higher levels in interesting ways, and that experi- 
mental methods for probing the lower level processes are well developed. 

The process of human speech production has several distinguishable 
subprocesses , organized hierarchically into levels. Parallels with the 
levels of linguistic processing are based mainly on operations that 
restructure the intended message to make it more compact and to put it 
into linear form, an obvious requirement for eventual output as a 
time-ordered acoustic signal. Speech has the additional feature that 
its processes change mechanisms on the way down; implementation is no 
longer done at all levels by neuromechanisms , but must include signal 
transformations in proceeding from nerves to muscles to gross movements 
and to sound generation. Thus, some of the restructuring, or encoding, 
that we find in the spoken message is a consequence of interface re- 
quirements. Linguists usually stop — perhaps with good reaaon — when 
they have specified the phonetic string (or its equivalent it; the form 
of a sequence of feature matrices), leaving actualization of the message 
to a human speaker. 

What remains to be performed — and to be modelled — are the succes- 
Ive conversions that lead eventually to speech: (1) The phonetic string 
must be grouped into pronounceable units and converted into a pattern 
of neural commands to muscles. This involves collapsing the string of 
linguistically discrete elements into an articulatory unit of about 
syllabic size and the temporal coordination of a substantial number of 
neural signals. Some of them may be crucial for maintaining phonemic 
distinctions (and some merely accessot'O* or all may be directed at 
achieving a particular target performance. The organization of these 
unit gestures is a topic of very lively interest in current speech 
research. (2) The pattern of neural impulses activates the qiusclea of 
articulation in a process that may be quite straightforward, or may 



involve ganmia-e f f erent feedback loops in important ways — another 
topic of current interest. (3) In either case, the gesture in 
neural form is converted, subject to muscular and mechanical restraints, 
into gross movements of the several articulators, and these in turn 
determine the configuration of the vocal tract and ics acoustic excita- 
tion. This involves encodings of quite a different kind from those 
mentioned above, but often quite extensive; they account at least for 
a mechanical component in coarticulation . (4) Finally, excitation and 
configuration determine uniquely---but not simply---the acoustic wave- 
form of speech • 

Efforts to model these processes have typically worked upstream 
from the acoustic level, usually dealing with a single conversion* 
Thus, the work of Fant and others has given us a good grasp of how to 
convert changing articulatory shapes and excitations into changing 
acoustic spectra* X-ray movies and sound spectrograms are only two of 
the experimental methods for exploring and testing these conversions* 
The relationships between muscle contractions and articulatory movements 
are under intensive investigation, using electromyography both to mea- 
sure muscle activity and to infer neural signals, and using x-rays and 
spectrograms to observe and infer the resulting movements * Efforts are 
being made to describe the organization of gestures in motor command 
terms, with verification to be provided by measurements on muscle activ- 
ity, configurations, and sound* 

Computer methods have been used to good effect in both the experi- 
mental work and in modelling conversions at the lower levels* They have 
been used also to good effect, but in quite different ways, in speech 
synthesis by rule* Thus, the terminal -analog type of synthesis by rule 
bypasses all the intermediate stages and operates directly from an input 
phonetic string to the output speech waveform* The articulatory type of 
synthesis by rule makes a lesser leap from phonetic string to articula- 
tory gestures, then uses level-by-level models to get to the acoustic 
output* The obvious goal is good modelling of the conversions at each 
level, confirmed by direct experimental measures wherever that is pos- 
sible, and also by the synthesis of natural-sounding speech when the 
models are use<?i in tandem* 



ERIC ' 62 



Interfacing Speech to Phonology 



It might appear from the preceding discussion that the processes 
of speech begin where those of linguistics usually end, I.e., with the 
message In the form of a phonetic string or the equivalent sequence of 
feature * matrices • However, the phones and features of the speech re- 
searcher are not or riot necessarily those of the linguist, since 

they are defined by different operations. It Is the resulting mis- 
match at this level that poses one of the major problems in modelling 
the total process of generating spoken language. 

When linguists use labels such as "compact" and "diffuse" for 
features, there Is no Implication that the features will convert 
directly Into components of a pattern of neural commands to muscles; 
when such labels as "voiced" and "voiceless" are used, the differences 
In operational definition are concealed, though they are no less real. 
The differences have their origins In the dissimilar approaches taken 
by linguists and speech researchers. The latter usually try to work 
with real mechanisms and models of processes whereas linguists more 
often concern themselves with relationships that they can formalize 
as rules, though exceptions are to be found on both sides. 

The Interface problem, then, has two different complexions: if 

linguists' rules really reflect underlying ir.echanlams and processes 

a claim that linguists rarely make for them and l£ current speech 

models prove to be tenable then a conversion is surely possible and 
finding It becomes an Important research goal. But It Is conceivable 
that the llngulstfit' rules are wide of the mark as to processes, h ow- 
ever useful they may be as descriptive devices. It would not then be 
possible to find the "real" conversion, though the search for it might 
make clear the directions in which plionologlcal theory ought to move. 
In any casa, the problem is Inescapable in some guise when automated 
language processing must operate across the boundary between speech 
and phonology. 
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Converting Generative Rules Into Analytic Rules 



The discussion of modelling speech production, including the 
special case of Interfacing speech to phonology, has all been genera- 
tive and most of the models that are concrete and believable are like- 
wise models of production processes. This does not imply that percep- 
tion has been less studied than production, but only that the research 
has yielded a more coherent set of relationships for the latter. 

There may be a good theoretical reason why this Is so: the pro- 
duction process Includes Important operations that are In principle 
lrreverslble---lrreverslble in the same senae that a drainage system 
would be irreversible, i.e., water does not ,run uphill and, if it did, 
it would not know which way to go at the confluence of two "downhill" 
streams. To the extent that speech perception is organized in motor 
terms and shares these irreversible operations, it cannot be expected 
to provide a model for straightforward analytic rules. Put another 
way, the production of speech involves encoding operations and so one 
must expect that the Inverse operations, like decoding in cryptography, 
will be Inherently complex and liable to ambiguity. 

An alternative view of speech perception does not link it to the 
motor system and so evades any need to run that machinery backward. 
It puts its dependence on auditory mechanisms, starting with feature 
detectors, and employs processes that are in principle descrlbable by 
models and analytic rull^s , though these have yet to be discovered. 

Clearly, the nature of speech perception is a central problem for 
speech research. Its relevance to computational linguistics, already 
discussed in connection with speech understanding systems, lies in how 
it affects one's choice of strategy in choosing methods for inferring 
the phonetic string from acoustic parameters, i.e., whether to stress 
analysis by synthesis with all its Inherent difficulties or to concen- 
trate on finding analytic rules, accepting the risk that they may not 
exist in any useful form. 
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Applications With Speech Involvement 



One can be reasonably certain that the practical applications of 
automated language processing will not lag far behind the development 
of a technical capability.^ It is easier to foresee examples that 
involve written-to-oral conversions than oral-to-written, so they will 
be discussed first. 

Reading Machines 

Synthetic speech as a reading service for the blind has already 
been discussed at length in terms of the research problems involved 
in setting up a central facility to make tape recordings from books. 
There is genuine need for such a service, especially for students and 
professionals. The practical objective is to have several service 
centers scattered across the country so that mailing delays will not 
be excessive. This will have to be preceded by a shakedown of the 
methods (still research oriented) and then some operating experience 
with a pilot center that uses production methods and equipment. It 
should be possible to accomplish both tasks within about five years, 
and to have begun the establishment of a network of service centers. 

An obvious extension is to allow local users to have on-line 
access to the text reading facility and, as a second stage, to make 
this access available by telephone. The latter would pose formidable 
problems if character recognition were performed centrally and only 
text scanning were done remotely. It may be, however, that newer 
methods of feature extraction, or total recognition of the printed 
text, will have been developed by that time, and so would make the 
data transmission problem quite manageable. Nevertheless, real-time 



^he current status of research and development in this area is 
reported in the Conference Record of the 1972 Conference on Speech 
Communication and Processing, April 24-26, 1972 , at Newton, Mass. 
[The Record is available from the National Technical Information 
Service or the Defense Documentation Center, AD 742236.1 





continuous processing poses very different problems from those of 
batch processing, some very similar to the problems encountered In 
real-time Interaction with a speech understanding system. 

Remote Retrieval of Information 

The same technology that reads for the blind can be used to 
allow quick access to library holdings by telephone from a remote 
location. Many of the local requirements for such a service will be 
met for other reasons In any case, so the additional investment need 
not be large. Thus, some types of library holdings (abstracts, 
blbl5,ographic Information, etc.) are increasingly being supplied and 
stored on magnetic tape, with programs that provide fast access to 
desired items. With a little help from the reference librarian, 
machine-readable information could be found and processed by the 
library's computer to yield synthetic speech which the remote user 
could listen to by telephone. Such a service will not answer all 
needs, of course, but it should be valuable in many Instances and it 
has the great virtues of requiring little by way of additional central 
facilities and of being able to use the existing telephone network 
instead of special terminals. 

An obvious elaboration is to allow the human caller to speak 
directly with the computer another application of speech under- 
standing systems. Even without this complication, however, there are 
Important linguistic problems involved in remote information retrieval. 
An obvious one is that much of the information now stored in machine- 
readable form is very "dense" in that it uses many abbreviations and 
graphic devices. Even connected text is often telegraphic in stvle. 
The reinterpretatlon of such information, now organized for the eye, 
to make it suitable for the ear becomes almost a condition for tele- 
phone access . 




Computer Assisted Instruction (CAI) 

Most CAI terminals operate solely with visual ou;tput and key- 
board input, not because these modes are always optimal but because 
other modes pose major technical difficulties. Speech output, In 
particular, would be highly desirable In many cases, and is clearly 
the method of choice for much of the interaction with children in 
the lower grades. They could benefit from a great deal of content 
instruction if it were presented orally, but they do not have the 
reading skills to cope with it visually. For older students, too, 
oral information would often provide a useful supplement to visual 
displays • 

This enhanced capability for CAI requires little more than 
adaptation of the text-to-speech techniques developed for the blind; 
in fact, the problem of providing good speech is easier, since the 
instructional text can be stored as a marked phonetic transcription 
that has been hand tailored to give natural sounding speech. More- 
over, the storage requirements — hence, the possibilities for truly 

interactive CAI programming are essentially the same as for literal 

text. Thus, the real utility of synthetic speech to CAI is likely to 
be far more dependent on imaginative programming than on technical 
limitations in providing spoken responses. 

The ideal arrangement, in adding a speech capability to CAI, 
would be to let the machine respond appropriately to spoken responses 
by the student. Special purpose solutions, comparable to digit 
recognition, might work very well in many cases , especially with 
older students. But the greater need, and certainly the greater 
technical challenge, lies in making it possible for the younger 
student to Interact in a reasonably free manner with his automated 
instructor. Clearly, this Involves all of the problems of speech 
understanding systems as currently envisaged, compounded with the 
technical problems of processing children's speech and the linguistic 
problems of dealing with their free-form syntax. As a practical 
matter, it would be a mistake to hold back on the use of speech as 




an output in the hope of an early solution to the input problem, 
despite the many advantages that two-way speech would have in 
enlivening the interchange and removing artificial constraints on 
instructional prograinning. 

Voice Typewriter 

The prognosis for typing or typesetting under voice control is 
probably no better than that for voice input to CAI. It is apparent 
by hindsight that the choice of the voice typewriter as an initial 
target for research on speech recognition was a serious error. Such 
a machine must deal with unrestricted inputs and a wide range of 
speakers and dialects. One can scarcely imagine a practical task of 
greater difficulty! However, both the nature of the problem and paths 
to intermediate goals that might lead on to an eventual solution have 
become much clearer. Thus, on the one hand, what we know of the 
nature of speech tells us that pattern matching will never provide 
a general solution, no matter how sophisticated the techniques; on 
the other hand, the use of a "side in** approach to speech understand- 
ing problems of limited scope promises to be one of the paths that 
might eventually lead to general speech recognition. 

Speech Understanding Systems 

The nature of the problems, the difficulties to be expected, and 
some of the areas in which research in both computational linguistics 
and speech can be helpful have already been discussed. The tasks in 
which speech inputs are to be used were chosen as demonstrations 
rather than practical applications. Even so, they are difficult 
enough to pose very real research challenges. 

The question is often raised about what, if any, really practical 
applications exist for voice input or, in more realistic form, what 
practical tasks there are that are not handled adequately by more 
conventional and less complex means . The ARPA Study Group listed 
some eight tasks as examples of practical applications; airline-guide 



information service, desk calculator (with voice input), air traffic 
controller, missile checkout (accepting spoken comments and questions 
from a human inspector), medical history taking, automatic protocol 
analysis, physical inventory taking (involving voice interaction with 
a human inspector), and robot management by voice. If .none of these 
seems of compelling urgency, it may be in part a reflection of the 
fact that our capabilities in speech recognition are still so primitive 
as to shackle our imaginations. 

Man*Machine Cooperation 

In summary, there are several practical uses to which speech 
outputs from automated language processing can be put, and probably 
will be put in the near future, though the prospect for practical 
application of speech inputs seems more remote. But one is tempted 

to say about speech recognition that, like Everest, it is there and 

eventually the challenge will be met. 

Man-machine cooperation with computers is already a fact of life, 
thou^ at present that cooperation can only be had on terms that are 
convenient to the computer. Again, one is tempted to say that such a 
state of affairs cannot continue; it is a safe prediction that man 
will insist on cooperation on his own terms. This means that computers 
must learn to listen as well as to talk. It will not matter much that 
this involves complexity and expense; if these were paramount considera- 
tions over the long term, we would all have telegraphs in our homes 
instead of telephones. 

Instrumentation for Research and Training 

The objective of the session's participants in their discussions of 
machines and speech was to consider not only promising areas for research 
but also needs and new possibilities for research tools. One suggestion 
that met general approval was that a state*of-the*art survey be commis- 
sioned to discover what the needs really are and to make generally avail- 
able a knowledge of recent developments in the leading research labora* 
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tories. Very often, new devices or software which are built to fill a 
local need do not seem to the investigators to be sufficiently imporr 
tant to justify separate publication. Hence, they remain unknown, 
except to a handful of visitors. 

This led to a discussion of how widespread the need might be for 
sophisticated new research instrumentation. The need is, of course, 
dependent on the number of centers in which basic speech phenomena are 
being studied intensively, and the prospect for additional centers. On 
this basis, instrumentation needs are comparatively modest quantitatively 
though crucial for the limited number of major research centers that do 

exist perhaps half a dozen in the United States, and comparable 

numbers in Western Europe and in Japan. The establishment of additional 
centers is made difficult by the "critical mass" (for both men and 
equipment) that i t> eded to do effective research; indeed, the increas- 
ing complexity of adequate tools and the need for cross -disciplinary 
approaches seem likely to increase the pressures toward centralization 
of research. There are, on the other hand, both a need for well-trained 
people, and a number of good academic centers where training could be 
much improved by enough research equipment to make that training modern 
and realistic. The sound spectrograph is one such tool that is now 
rather widely available. Speech synthesizers, on the other hand, which 
could be at least as useful, are rather rare. This seems unfortunate, 
especially since the technology is well known and the costs are not 
excessive. Thus, a good background for many of the kinds of speech 
research described in preceding sections could be obtained with a mini- 
computer plus disc file, obtainable for $10-15,000. The point was made 
that the much larger computers already available for batch or time- 
shared use at many universities are not adequate substitutes for even a 
mini-computer that can be used on-line; in fact, very few computer 
systems can handle speech, primarily because of the high, continuous data 
rates that are required. A state-of-the-art review could be particularly 
useful to schools that wish to install a training facility of the kind 
described, not only in alerting them to the possibilities, but in provid- 
ing detailed information that often takes a great deal of time to learn 



by trial and error — a familiar experience, summarized by one discussant: 
"the first program costs a year". 

Some of the new developments and trends to be expected in research 
instrumentation will be cheap mass memories of very large size, and a 
new order of magnitude in central processor speeds. There was general 
consensus that the trend toward interactive systems that operate in 
real-time will continue, with a large payoff in research productivity. 
Likewise, new facilities for graphic output are becoming easily available 
and will be most useful. 

In sunmary, although this part of the discussion found continuing 
progress and no urgent needs in the limited number of centers where most 
of the basic research on speech is done, it delineated a considerable 
need to upgrade awareness and training facilities in a much larger number 
of university centers in order to enable their graduates to become 
familiar with modem methods and problems. A state-of-the-art survey 
would be a useful initial step. 

Conclusion 

The group's discussions concerning Machines and Speech dealt mainly 
with the nature of the research problems that are encountered in incor- 
porating speech into computational linguistics. Two specific applica- 
tions reading services for the blind and speech understanding systems 

— were discussed at length. Both are examples of an increasing trend 
toward automated language processing and, in particular, extensions of 
this technology to the use of speech as an input-output modality. 

The group identified a number of specific areas in which there is 
strong interaction between the usual domains of speech and computational 
linguistics. Thus, for example, the synthesis of natural sounding 
speech, starting with written text, requires information about the 
placement of sentence stress, about durations, and about pitch contours. 
This is information that is implicit in the structure of written sen- 
tences; hence, good synthesis would seem to require a capability for 
parsing. Conversely, in attempting to infer sentences in written form 




from an oral Input, the suprasegmental information could provide much 
help in assigning structure to a string of phonetic elements. In 

general, many cf the problems and some very promising areas for 

research — lie in the conversions that must be made between language 
in written form and language in oral form. Thus, the addition of 
speech as input and output modes for automated language processing will 
necessarily focus attention on a whole set of problems that might 
otherwise pass unnoticed. 

There are other areas of speech research that also interact with 
higher level processes. Thus, efforts to build detailed models of 
the processes of speech production (and to apply them to synthesis) 
must start with a description provided by phonology, and so cannot 

ignore the interface presently missing between speech processes 

and phonological rules. 

Practical applications that make use of speech as an output from 
automated language processing are well on the way to being realized: 
reading services for the blind, remote retrieval of information by 
telephone, and a vocal response capability for computer assisted 
instruction. The prospects for processes that use speech as an input 
are more tenuous, though a major effort is under way to build demon- 
stration models of speech understanding systems, i.e., computers that 
will accept instructions or questions via microphone. For the long 
term, there is little doubt that man-machine interaction will become 
increasingly important in a practical sense, or that there will be a 
steady pressure on the machine to conform to human convenience, i.e., 
to learn to talk and to listen. 

The state of speech research here and abroad was also discussed 
briefly and it was noted that the trend toward concentration of research 
in only a few major centers is likely to continue because of the critics 
mass of men and instrumentation needed to deal with problems that are 
increasingly complex and multidisciplinary . But adequate research 
training need not be correspondingly concentrated; the provision of 
modest research facilities — in particular small computers used for 




synthesis studies could do much to broaden the base of research 

training. A state-of-the-art survey and prospectus would be a useful 
first step. 




LANGUAGE PERFORMANCE (PSYCHOLINGUISTICS AND DIALECTOLOGY) 
John B. Carroll, Chairman 



Although the scope of the topic assigned to this group was not 
defined in advance, the group took it to imply any application of 
computational techniques to problems of describing or modelling verbal 
output in actual communication situations* While such a scope could 
include the use of computational grammars and parsing techniques as 
discussed by other groups at the conference it was agreed that an 
emphasis would be placed on what kinds of information could be 
generated by the application of such techniques, assuming that the 
techniques had already been brought to some level of usefulness (rather 
than the specific problems of perfecting such systems). 

The initial difficulty of setting the ground rules for discussion 
was further compounded by the fact that even the notion of ''language 
performance" is not well defined, for in many ways it is related to 
the notion of "language competence", in the sense that the rules of 
"language performance" depend upon the rules of "language competence" 
and to a large extent may be identical. 

Likewise, psycholinguis tics is an ill-defined term. Some would 
apply the term to any scientific study of the use and understanding of 
language in language users; others would restrict it to the study of 
such matters as the special ways in which particular aspects of 
linguistic systems (phonology, syntax, etc.) are acquired and used. 
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It was recognized that computers are already very widely used in 
psychology, in the statistical analysis of psychological data, in the 
generation of stimulus arrays and experimental designs, and in "on-line" 
experiments in which subjects interact with computer-generated displays. 
(For references, see, for example, Cooley and Lohnes , 1971; Veldman, 
1967; Tepas et_ al . , 1972.) However, it is rare that any techniques 
similar to those of computational linguistics are involved in such uses; 
i.e., it is rare that rules relating to language structure are involved 
here. Nevertheless, it is difficult to draw the line; e.g., would a 
computer program for generating random strings of phonemes under certain 
linguistic constraints be regarded as falling under the heading of 
"compu ta t ional 1 inguis tics "? 

The topic of dialectology was treated rather summarily, almost as 
an afterthought, partly because most techniques thus far developed in 
computational dialectology are essentially "bookkeeping" techniques 
used to store and categorize data. However, several interesting ideas 
which were brought up by participants will be discussed in the report. 

Under very loose ground rules that mere "bookkeeping" uses of 
computers would not be considered, the group proceeded to survey areas 
that had already received some study and/or that might hold promise for 
future work. 

Studies of Linguistic Frequencies 

Dangerously close to "bookkeeping" uses of computers , but still 
holding some possibilities of theoretical interest, are studies concerned 
with the distribution of linguistic elements with respect to frequency, 
i.e., what is the relative frequency of each of a given set of linguistic 
elements such as words, morphemes, phonemes, phoneme sequences, grammat- 
ical structures, etc.? Are there interesting ways in which to charac- 
terize these distributions? Can inferences be drawn from finite samples 
as to the characteristics of populations of infinite size? Work along 
these lines derives from that of Zipf (1935, 1949) and his predecessors 
(Estoup, 1916; Condon, 1928). Zipf studied various kinds of frequency 
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distribution, frequencies of words, word lengths, Chinese characters, 
etc. and claimed that a certain law would characterize them, namely 
fr ■ k, where ^ is the frequency of a given word type, x its rank 
(the number of items that occur ^ times or more), and k is a constant 
depending on the sample size* (This discussion is considerably over- 
simplified; for a more detailed discussion, see Miller & Chomsky, 
1963, pp. 456-464.) Three major lines of further development are 
those of Mandelbrot (1961), who elaborates Zipf's function, Simon 
(1955), who generates a certain stochastic process, and Herdan (1960), 
Howes (1971), and Carroll (1969), who use a lognormal distribution or 
variants of it. In particular, Carroll and Howes have developed 
methods of estimating population distributions from sample distribu- 
tions. The computations involved in all these applications are com- 
plex and require the use of high-speed computers. There still remain 
serious problems as to the generality and fit of the mathematical 
models that are claimed. 

Computers are also used, of course, in the setting up of the 
raw frequency distributions from large corpora of data, but the 
sorting methods used here are simple and resemble those used in the 
development of concordances (Lamb & Gould, 1964). Among examples of 
extensive frequency counts done by computer are those of Kucera and 
Francis (1967), Carroll, Davies, & Richman (1971), and Roberts (1965). 
The first two of these counts were of the frequencies of words defined 
as graphic sequences; the Roberts count was of phonemes and their 
transitional probabilities, derived from a 'liand-made*' phonemic izat ion 
of the word-types in a previously available corpus. One serious 
problem that has occurred in the graphic word count studies is that 
the results do not distinguish homographs, and fail to group the 
words by lemmas. Juilland and Chang-Rodriguez (1964) solved this 
problem by a hand post-editing of the computer output, but a better 
solution could be obtained if a highly accurate computer parsing 
procedure were available to be applied to the source corpus. 

A related problem is that of deriving rules for English spelling- 
sound correspondences, either the rules for going from phonemes to 
spellings (Hanna et al . , 1966) or the rules for going from spelling 




to phonemes (Venezky, 1970). Computers have been used extensively in 
this work, either for inducing rules from large corpora or pairs of 
printed words and their phonemicizations , or for testing rules on such 
corpora. Again, because of the anomalies of English orthography, the 
success of this work has thus far been limited by the difficulty of 
attaching appropriate semantic and grammatical readings for the words 
involved; presumably adequate parsing techniques would be of great 
help here. The practical importance of this work is that it would 
aid in the preparing of materials in the teaching of reading and 
spelling; further, it would supply links in any complete computerized 
system for going from speech to print or from print to speech. 

Use of Computers in Describing and Analyzing Language Output 

Many types of psychological investigations involve the eliciting 
or gathering of large quantities of verbal output. The sources and 
purposes are varied; examples are: 



TAT (Thematic Apperception Test) protocols from 
individual subjects, to diagnose aspects of 
personality and motivation (Murray, 1938; 
McClelland et al . , 1953). 

Psychiatric interview data, to study themes in 
patient thought content or to gauge progress in 
therapy (Iker & Harv?ay, 1965). 

Folk tales from different cultures, to study 
common elements in thematic content. 

Political speeches, propaganda, and the like, to 
assess political attitudes, subtle changes in 
propaganda **lines*', etc. 

Teacher-student interactions in the classroom, to 
study ''classroom climate" and factors underlying 
effective teaching strategies (Bellack et al. , 
1966; Loflin et al . , 1972). 



In the past, verbal outputs of these kinds have been studied 
primarily by subjecting them to elaborate and time-consuming hand- 
coding schemes, but with the advent of the computer there has been 




Increasing Interest In automating these coding processes. One very 
general methodology that has been developed, based largely on computer 
techniqucis. Is knovm as the General Inquirer program developed by 
Stone, et^ al.» (1966), Essentially, this program Is based upon the 
construction of specialized dictionaries wherein particular words 
are coded for specified themes or contents; the computer processes a 
text and counts the number of words associated with each of a number 
of thenies as specified by the dictionary. In early forms of this 
program, words were counted simply as .graphic shapes without regard 
to their actual meanings (which might count, for example, the word 
storm under the theme aggression when It referred to a weather 
phenomenon). Later versions of the program have included some 
attempted to disambiguate meanings in terms of context and grammatical 
form (Kelly, 1970), but It cannot be said that these attempts have 
utilized any truly sophisticated parsing rules, 

A somx^what more sophisticated use of the computer to parse 
verbal output has been developed by Professor Baldwin of Cornell 
University (personal communication). In his studies of mother-child 
interactions, observers are taught a limited language for describing 
these interactions; it has been found possible to parse the resulting 
descriptions and thus obtain counts and other analyses of the volumi- 
nous interaction data thus obtained, 

Barron (1970) developed a computer program for identifying and 
counting linguistic elements classified according to Fillmore^s case 
grammar, and demonstrated sex differences in the verbal styles of men 
and women teachers , 

Page and Paulus (1968) attempted to develop a computer program 
that would grade student-written essays as accurately as teachers do. 
Darnell (1970) developed a program for measuring the "clozentropy" of 
responses of foreigrt students learning English when they attempted to 
guess missing elements in English sentences. 

This is only a very partial and selective listing of work in this 
general area. While all these programs have been partially successful. 




it is believed that work of this type could benefit very much from 
systems of language analysis that might be developed through a deeper 
understanding of language construction, i.e., the computer parsing 
techniques discussed by other groups at this conference. 

Studies of Semantic Networks and the Subjective Organization of 
the Lexicon 

One of the first major attempts to develop a quantitative approach 
to semantics was that of Osgood, Suci, and Tannenbaum (1957), embodied 
in the technique known as the semantic differential . The computational 
techniques were primarily those of factor analysis, applied to the very 
large data matrices of responses of subjects to semantic rating scales. 
A kind of zenith in this kind of work was reached in the atlas of 
semantic differential scalings published by Jenkins, Russell, & Suci 
(1958). Semantic differential data have to do> however, primarily with 
the connotatlve aspects of "meaning"; generally, this connotative space 
is found to contain three main dimensions > evaluation, activity > and 
potency. These dimensions probably have little to do with the semantic 
space of denotation. In more recent years , psychologists have attempted 
to determine the characteristics of denotational semantic spaces, 
largely using multidimensional scaling techniques. The work of 
Fillenbaum and Rapoport (1971) may be cited in this connection. 

Other approaches to semantic analysis are represented by the work 
of Minsky (1968), Quillian (1969), Simmons et al . (1968), Reitman (1965), 
and others, all being concerned with how semantic information can be 
represented in computer memories in such a way that it can be accessed 
and manipulated in such "artificial intelligence" tasks as question- 
answering, determining semantic similarity, and simulating a dialogue. 
Psycholinguists have taken some interest in these formulations, but thus 
far the techniques they have employed to test hypotheses about semantic 
organization have utilized more conventional experimental settings 
(measurements of reaction time, etc.) rather than computer technology 
(e.g., Collins & Quillian, 1969; Myer, 1970). In fact, such techniques 
are probably highly appropriate for these studies, although one can also 



envisage a role for computer technology in interactive bettings in 
which, for example, an individual's semantic networks are compared 
with those specified in a computer model. 

Modelling of Speaker and Hearer Behavior 

Psychologists have shown much interest In attempting to simulate 
human behavior » particularly higher mental processes involving language » 
concepts, and problem solving (e.g.. Hunt, 1962; Reltman, 1965; Newell 
& Simon, 1972). Usually these attempts are formulated as computer 
programs* Simulation Is viewed as essentially different from "artifi- 
cial intelligence" in that the former is concerned with attempting to 
replicate, in the computer, the actual sequence of Information pro- 
cessing activities that takes place in the human organism In learning 
a concept or solving a problem, whereas artificial Intelligence is only 
concerned with procedures whereby the program can achieve, by any means 
found feasible, a result that resembles or even surpasses (in quality, 
speed, or accuracy) what could be achieved by human intelligence. 
There are serious problenis In "validating'* computer simulations because 
the actual processes being simulated are accessible to observation only 
Indirectly, if at all. It can be proved mathematically that a given 
observed result can be achieved by a multiplicity (I.e., at least two) 
of different operational procedures; thus, the choice of which procedure 
Is actually being followed by a given human subject on a given occasion 
Is theoretically indeterminate. Nevertheless, persons working in the 
field of computer simulation believe that this type of work will help 
to narrow down the alternative possibilities. 

The group discussed briefly the possibilities of simulating various 
types of language behavior. It was recognized that any such simulation 
would have to be based on an exceedingly well -organized and serious 
theory of language behavior. Including not only rules for language 
competence but also for language performance. Nevertheless, the group 
was impressed with the advances that have been made with certain simple 
simulation v^^ograms such as that of Uinograd (1971), wherein a robot 
accepts English sentences and performs corresponding actions in locating 
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and picking up objects such as colored blocks. Kaplan's (1971) 
application of augmented transition networks to psycholinguistic 
Investigation also seemed very promising. 

Mention may be made of some partially successful attempts in 
this area. Uhr (1964) developed a program that could learn to 
"translate" from one language to another by comparing strings. 
Siklossy (1971) has recently developed a program called ZBIE which 
(according to the author) "accepts tree structured descriptions of 
simple situations, and improves its capacity to express these 
situations in the natural language that it learns." Siklossy gives 
examples of how either English or Russian sentences can be learned 
by the program in this way, and claims that his results "support the 
hypothesis that language-learning need not require postulated 
separate language -acquisition devices." 

Ronald Kaplan reported unpublished work of Donald Olivier at 
Harvard dealing with the question of whether a computer program 
could "learn" to segment natural speech (e.g., into words) by use 
of frequencies of occurrence and transitional probabilities. Con- 
siderable success had been achieved by such a program, lending 
further support to the notion that the child as language learner 
needs only relatively limited acquisition devices. 

Dialectology 

In discussing possibilities of computer use in dialectology, 
the group noted that up to now, most work has concentrated on 
archiving information, computing indices of lexicographic similarity, 
and the like. The computer can indeed be of much use in such 
endeavors, but is largely merely a substitute for clerical work. 
Joseph Grimes reported that he had developed a program for accepting 
dialect information and plotting dialect areas. Some reports of the 
uses of computers in dialectology are to be found in Garvin (1965). 
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It was cotonented that In view of the large Interest in American 
regional and minority dialects and the voluminous data that might be 
involved in any thorough study of such dialects, use of computers 
would be almost mandatory to achieve useful results efficiently. 

Some mention was made of the possibility of using computers in 
glottochronological studies in which masses of data from different 
members of a language family would be analyzed for common elements. 
Some work of this sort has already been done (Carroll & Dyen, 1962). 
Mention was also made of the possibility of simulating language 
change, e.g., testing the operation of sound-law rules in both well- 
known (Indo-European) and less well-known language families. 

Conclusion 

It was felt that psychological and psycholinguis tic studies 
could profit enormously from the theories and techniques now being 
developed in computational linguistics. Few psychologists are 
Sufficiently aware of these possibilities. 
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SOCIAL IMPLICATIONS OF AUTOMATIC LANGUAGE PROCESSING 
Donald E, Walker, Chairman 



When computational linguistics first began to be Identified as a 
field of endeavor about a quarter of a century ago, the social Implica- 
tions contemplated were only positive ones* Computers would process 
language a tuomatlcally to provide mechanical translation (Weaver) and 
talking typewriters (Bush) or at least reactive ones (Mooers), and the 
consequences envisioned for society promised both International and 
trans-cultural cooperation and Individual enhancement and augmentation* 
Recently, however, concerns about the Impact of science on society have 
resulted In a sharp emphasis on the dangers that might result from 
computational linguistics* Invasion of privacy Is envisioned, through 
listening In on telephone conversations or through the ability to 
address sophisticated queries to the Increasingly comprehensive data 
files that are being accumulated* What seems needed at this time Is 
a perspective both on the positive and on the negative Implications of 
computational linguistics* What do we need to be concerned about? How 
can we guide the direction of the technological applications of our 
scientific advances? The discussions at the session on Social Implica- 
tions of Automatic Language Processing were addressed to these questions* 

There are two major contexts within which to examine the appllcatlor 
of computational linguistics* The first Is provided by technological 
developments outside of the field, specifically, the proliferation and 
Increasing availability of Interactive terminals* Embedding similar 
devices In "the wired city" and allowing access to the capabilities of 



the computer through natural language over telephone lines or through 
radio and television links could have remarkable consequences for the 
life of the average citizen* The experiments In Reston, Virginia, 
make these possibilities more real. 

The second major context for consideration Is that of social 
needs. How does computational linguistics relate to contemporary 
urban problems or to the newly recognized responsibilities to people 
In Isolated communities? Literacy and communication still are major 
problems* Social planning requires Increasingly larger and more 
complex Information bases for decision making* And there Is Increas* 
Ing recognition of the right of the people both to know and not to 
be known without their knowledge. How and where does computational 
linguistics fit In? 

It Is reasonable before considering these questions directly to 
look more closely first at the scientific enterprise. And the Initial 
topic for group discussion was In fact the sources and effects of the 
funding practices of the Federal Government* Quite apart from any 
specific Information about technical content or projected goals, the 
Identity of the agency supporting research can prompt attitudes toward 
the work. For example, some participants noted that money from the 
Department of Defense has been viewed as bad In principle, an Interpret 
tatlon complicated and confounded by the Mansfield amendment which has 
resulted In odd discrepancies between the descriptions of a project 
made by a university recipient and by the particular government agency 
sponsoring It. 

One recommendation strongly urged was that all research funds (or 
at least those now handled by the Department of Defense) be provided 
through the National Science Foundation. Although the merits of such 
a proposal were recognized. It was argued that the Congress would 
never allocate as much money through NSF, so one consequence would be 
a drastic reduction In support for research* Equally compelling, 
however, was the argument that massive funding from a single source 
would Inevitably narrow the range and variety of projects likely to 
be approved. Moreover, the other agencies need to have a knowledge 
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of the basic sciences directly available In order to make Informed 
decisions In support of development and engineering Implementations 
over both long and short periods • 

There Is another effect of funding practices, less often recog- 
nized explicitly and with narrower "social" Implications, specifically 
those for the particular scientific community* Massive concentrations 
of funding on a particular problem can have consequences for a field 
of Inquiry. Support of mechanical translation In the late 1950's and 
early 1960 's was presented as a case In point* Pressures toward a 
practical product In the absence of parallel support for the research 
essential to Its achievement cannot succeed, and the consequences for 
computational linguistics because of the unreasonable expectations 
raised by the MT program were significant. 

The Advanced Research Projects Agency's recent program to develop 
prototype speech understanding systems was given careful consideration 
by the participants for this reason. The assertion has been made that 
funding for this one activity Is at least as large as the current 
support for all other speech research. If not larger. However, it was 
pointed out that the comparison may be misleading, because the ARPA 
program cuts across the full spectrum of language. Including semantics, 
syntax, and morphology, as well as acoustics, phonetics, and phonology, 
and because It Involves modeling the breadth of language capabilities. 
Nevertheless, the appropriateness of the concern was recognized; It Is 
essential that some balance be kept, so that speech understanding can 
build on a continuing base of research. 

One special problem needs to be kept In mind In this context* In 
the presence ot large research programs, particularly attractive ones, 
there Is a tendency (pronounced. If not always well articulated) for 
scientists to adjust their Interests a little too much In accord with 
these programs. Consequently, funding agencies may find It necessary 
to overcompensate In making their decisions about projects proposed, 
dellberar.ely supporting those that are not ''In the mainstream". 
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The preceding remarks have Involved narrow Interpretations of 
the term social, although ones too often Ignored In discussions of 
the Implications of science. The major concern of the session 
certainly was with the effects of automatic language processing on 
the larger society. Consideration was given first to more positive 
Implications . 

Enhancement of communication still Is expected to be a major 
consequence of successful work In automatic language processing. 
Mechanical translation can contribute to International communication 
or at least communication across national borders. There are func- 
tioning programs In ffT, somewhat crude but both usable and used. The 
most Interesting near-term prospect In this area Is for computer aids 
to translation. Coupled closely with attempts to capture and model 
the translator's actions, this approach appears particularly promising. 

Communication can also be considered with a more local focus. In 
the context of existing systems for personal file handling, the pros- 
pects for augmentation of human intellect (Engelbart) loom particularly 
large. Question-answering systems are laboratory exercises at this 
time. However, to the extent that they can be brought Into Interactive 
time-sharing systems, users who are specialists in technical areas that 
involve textual data should be much more efficient in their explorations 
through the existing literature and much better able to relate, edit, 
and even create documents in ways that add to the knowledge in their 
specialty. 

The procedures that underlie the "augmentation of human intellect" 
of ihe research scientist can become part of the increasing automation 
in the library. The current experiments (e.g., Intrex) have made very 
little use of the potential of computational linguistics (and appropri- 
ately so). However, the critical barrier is the willingness of the 
library user to tolerate and , more importantly, to explore the potential 
of computer assistance. If bibliographic support proves satisfying in 
the library context, we should increase the sophistication of the proce- 
dures toward fact retrieval, with significant consequences for the 
effective dissemination of knowledge. 




The same kinds of computer capabilities that are relevant for 
augmenting libraries and human Intellects can be applied directly to 
education and to the Important social Issue of literacy which has 
such significant Implications for participatory democracy. Few 
current computer-aided Instruction techniques make use of computa- 
tional linguistics. The possibility for meaningful and constructive 
dialogue with a student requires a degree of sophistication that can 
be provided only by techniques that penetrate beyond form into content. 
Moreover, such dialogues cannot be based on predetermined scenarios; 
they require the possibility of alternating questioning and answering 
phases from both student and program. Prerequisite for the development 
of any such capabilities is a comprehensive computational analysis of 
language . 

From educational implications it is a natural extension to a 
consideration of the consequences of automatic language processing 
through Interactive terminals in "the wired city". As remarked at 
the beginning of this section, through such a capability the citizen 
would have access to substantial computer power. Whether he is able 
to use that power depends critically on its accessibility through an 
interface that respects his customary modes of interaction. Techniques 
for automatic language processing must respect habits that are language- 
based, if systems are to be responsive to the general public. 

The technical particularities of computational linguistics that 
would make for revolutions in education and in people's dally lives 
were not elaborated on in the discussions. Treated with somewhat more 

substance but still with little in the way of specifics regarding the 

relevant hows of automatic language processing — were the implications 
for the medical and legal professions. Ellcltatlon of medical histories 
multiphasic health screening, the storage and retrieval of medical Infor 
matlon, the more precise determination of variations in symptom patterns 
and diagnostic indices through processing large quantities of patient 
protocols, perhaps even diagnosis Itself, all might be enhanced through 
automatic language processing capabilities. Comparable statements were 
made about the legal profession. Such fmhancements certainly would make 
the practice of law and medicine more efficient and more effective and 
could he extended to other areas . 
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However, along with the positive social values associated with 
the potentials of computational linguistics and automatic language 
processing, a number of possibly negative consequences were considered. 
One of the most frequently cited relates to the current propensity In 
our society for aggregating Information about Individuals In central- 
ized files. To the extent that natural language query capabilities 
are developed, these files could be accessed In violation of a person's 
privacy. Even If the retrieval strategies prove to be relatively 
unsophisticated. It Is possible that personal files would be structured 
to reflect only a relatively small class of standard or stereotyped 
Information Items about an Individual. The resulting Impersonality 
might be valuable for statistical overviews, but It would blur meaning- 
ful distinctions among people. 

The area of speech processing, recognition, and understanding again 
provides an Illustration. Although voice authentJxatlon might prove 
valuable for the banking and credit card Industries > It Is viewed less 
sangulnely In relation to eavesdropping and personal surveillance. The 
possibility that •••freedom of speech^^ could provide ••instant evldence^^ 
looms as a serious threat to personal security, particularly with a 
technology whose accuracy leaves something to be desired, i^holesale 
recording of conversations Is less attractive If It requires t^asslve 
amounts of time to discriminate relevant from Irrelevant segments. 
Screening techniques that just Identified the presence of human voices 
would be helpful. If, further, It proved possible to Identify particular 
human voices, and then to discriminate when they were talking about 
particular topics, the Implications for the violation of human rights 
would become more and more Important to consider. 

Another kind of violence to human rights also needs to be considered, 
this one from the context of education. The use of crude and Inadequate 
capabilities for automatic language processing In programs for computer- 
assisted Instruction could lead to what one participant termed ••robbing 
a generation of students of their Intellect^*. It certainly seems that 
CAI has been relatively unproductive, but It has also been obvious, even 
to casual observers, that the particular programs used have been limited 
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in scope. Introducing complex but inadequate — linguistic interfaces 

could lead to expectations of competence that are not justified. 

In the brief review of possible negative social consequences of 
automatic language processing, it was apparent that in some kinds of 
applications the dangers arise from success, in others from partial 
solutions applied prematurely to the scientific and technical problems 
involved. The {.".rticipants also agreed that the professional involved 
in computational linguistics has a responsibility to illuminate both 
the problems and the potentials of his resiiarch. 

Economic factors are relevant in these considerations with impli- 
cations that may be variously positive or negative. Will automated 
language processing techniques increase or reduce the overall cost of 
computer-based facilities? If such systems are commercially feasible, 
will they be priced so that they are available only to particular 
socio-economic groups? Will they serve to increase or reduce conflict 
among these groups? These questions are not unique to computational 
linguistics, but they do merit careful examination. 

Science exists in and necessarily has an effect on society. It 
is unlikely that a more perceptive view of the consequences of develop- 
ments in atomic energy would have precluded the development of the 
atomic bomb. Similarly, work In so-called genetic engineering will be 
pursued In spite of the potentially disastrous consequences that have 
been projected. What seems essential is to demarcate clearly between 
what we know and what we do not know at any given time. 

In recognition of the fact that social and political factors could 
coerce what might be considered premature use of automatic language 
processing technology, it was suggested that it might be necessary to 
increase support in certain areas &o that distinctions can be made 
between science and "black magic'*. As a case in point, recent court 
decisions have sanctioned the use of **voiceprint** identification as 
evidence in legal proceedings. Since there is no consensus among 
specialists in phonetics and acoustics regarding the reliability of 
such identifications, it is essential that further research be conducted 
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to establish the limits of credibility. However, one probable 
consequence of this research Is the development of more precise 
techniques for the Identification of specific Individuals on the 
basis of voice characteristics. 

There is no way of removing science from society. What is 
needed rather is to make more explicit and more accessible to the 
public the state of scientific knowledge at a given time. However, 
the area of computational llng;ilstics has one characteristic that 
could make further developments society-serving in a particularly 
significant way. For a natural language Interface to a computer 
can make data available to the public as well as to the government 
agency or the computer specialist. Further research certainly will 
result in simplifying access to computers by nons pec la lists and in 
increasing the availability of computational capabilities. Social 
and political changes will be needed to allow for the most productive 
use of this accessibility. However, these changes do seem to be 
consonant with our country's movement toward a more open and more 
democratic society. Computational linguistics could contribute to 
our ability to treat each individual more uniquely, respecting more 
clearly his individuality. Accordingly, it does seem that the most 
significant social implication of automatic language processing can 
be that of providing "all computer power to the people'*. But compu- 
tational linguists must make clear the dangers as well as the potential 
inherent in that vision. 
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PROFESSIONAL STANDARDS, ETHICS, AND EDUCATION 
Sally Y. Sedelow, Chairman 



During the sessions on professional standards, ethics, and 
education, the participants spent more time discussing the question 
of education for the computational linguist and seemed less concerned 
with other topics* This report reflects that emphasis* 



Professional Standards 



One of the problems that were brought up was the desirability 
of devising a system for evaluating or refereelng papers or technical 
reports which are sent around In mimeographed form* Such papers and 
reports sometimes create the Illusion and, ultimately, reality of a 
sanctioned tradition, a canon, when It may be both undeserved and 
undesirable from the point of view of the profession as a whole* In 
the course of this discussion It was noted that computational linguis- 
tics Is In a somewhat unusual position Inasmuch as there Is no single 
journal devoted exclusively to this area* Currently, computational 
linguists choose among journals such as Language , Computer Studies In 
the Humanities and Verbal Behavior , The Journal of Verbal Learning 
a nd Verbal Behavior , Computers and the Humanities , and Behavioral 
Science* It was noted that the Association for Computational Linguis- 
tics had looked Into various publishing alternatives and no single 
satisfactory solution was found* On the other hand. Mechanical Trans- 
lation , past publication of the Association for Computational Llnguls- 
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tics, folded up because it was not receiving a sufficient number of 
quality papers to sustain a regular schedule of publication. 

One suggested resolution to the standards problem was the 
establishment of a panel of referees who would evaluate abstracts 
and articles which are distributed through informal channels. There 
was considerable objection to the prospect of evaluating abstracts 
but some positive response to evaluating articles. 

The desirability of good program documentation and program- 
testing was also stressed, but no mechanism for assuring high stan- 
dards was suggested. 

Professional Ethics 

Some professional associations, e.g., the American Psychological 
Association and the American Bar Association, have drawn up statements 
concerning ethics and standards. For example, one of the ethical 
issues with which both psychologists and lawyers are concerned is the 
preservation of confidentiality. Although some felt that a statement 
on ethics and standards for computational linguists might well be 
explored, nothing concrete as to either the content or the mechanism 
for drawing up such a statement emerged from the sessions. 

Education 

Most of the session participants' time and energy went into 
attempting to identify a core body of knowledge which a practicing 
computational linguist might Ideally have. Participants noted that 
the areas listed need not imply courses, although courses might be a 
useful way to satisfy some or all of these needs. Nor, if courses 
were being considered, should each topic suggested necessarily imply 
as nuch as a semester or quarter term course. It did not seem 
fruitful to the participants who discussed this topic to try to 
specify programs appropriate for the B.A., M.A., and Ph.D. degrees, 
nor, in fact, to specify content of courses. Such prescription was 
avoided because variation among institutions is great as to academic 




areas (e.g., no linguistics department, no statistics department, etc.) 
and as to preparation of students deemed acceptable for any given 
degree program. For example, many linguistics departments assume that 
Lhe student entering the M.A. program has had very little linguistics. 
Or, as another example, students with a background in computer science 
who desire to be computational linguists require supplementation in 
areas different from those required by, for example, someone from 
English. It is possible that in time, separate programs or departments 
in computational linguistics will emerge. But, for the present, it 
seemed most desirable and feasible to designate areas of knowledge in 
which a computational linguist should have competence. One other 
caveat is probably worth mentioning: that is, many currently practicing 
computational linguists do not have the range of knowledge represented 
by the areas listed below. Such a situation almost always exiots when 
a new academic area, or emphasis, is emerging and the session's 
participants felt that academicians and scholars who see certain needs 
---even those which they do not themselves fulfill-— should, at this 
point, take comfort in their liberality of spirit and vision rather 
than feeling demeaned for falling short of the ideal. Actually such 
a situation exists in the long-established disciplines as well. 

The core body of knowledge given below was compiled from infonral 
suggestions and therefore should not be looked upon as exhaustive either 
in its contents or organization. 

A. Formal Theory and Methods 

Discrete mathematics (e.g., finite graph theory, combinatorics 
(elementary), semi-groups. Boolean algebra) 

Logic (e.g., propositional and first order predicate calculus), 
naive set theory (together with a sample set of axioms) 

Introductory Automata and Computability Theory 

Formal Grammars and Languages 
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Computer Scienc e 



Computer programming as a skill, including at least one high 
level programming language. It is assumed that the intro- 
duction to computer programming also will provide an intro- 
duction to basic computer organization and to an awareness 
of, and experience with, a range of processing modes (e.g., 
interactive, batch, remote batch). 

Data Structures and File Management 

An introduction to the nature and range of programming lan- 
guages, including some minimal experience with several 
programming languages. 

Knowledge of how to construct as well as practice at construct- 
ing compilers 



Linguistics 

Phonology (stressing morphophonemics with less emphasis on 
phonetics and phonemics) 



Rule-writing techniques (tricks of the trade) for grammars 

An understanding of the concept of modeling natural languages 

Survey oi* theory available to the computational linguist 

"Structure" course in some language (e.g., English or a non- 
Indo-European language;, some participants felt that it must 
be a non-Indo-European language and others felt that as long 

as the course emphasized structure, even English assuming 

the native speaker were English — would be acceptable) 



Computational Lin?.;uis tics 

Knowledge of the state of the art (including heuristic/artifi- 
cial intelligence approaches) with a thorough knowledge of 
three or four prototype systems 



Syntax 



(morphology and discourse analysis, associated 
with both these areas, should be included) 



Semantics 
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The areas specified above, under A through D constitute the core 
recommended for a practicing computational linguist. In addition, some 
computational linguists might well choose to concentrate on a given 
area of specialization, implying, of course, additional knowledge. 

For example, computational linguists specializing in speech must 
know experimental phonetics, experimental psychology, the calculus 
through differential equations, and signal processing. 

Other area specializations might include: 



Other computational linguists might well do ancillary work (this 
need not imply an emphasis, although it might) in such areas as: 



Text Processing 
Statistics and Probability 
Lexicography 
Computer Architecture 
Computer Software Systems 
Computer Graphics 



Ps y ch ol inguis t ics 
Sociolinguistics 
Anthropological Linguistics 
Historical Linguistics 
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